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PATENT 

ATTORNEY DOCKET NO: CIT1350-1 

CHECKPOINT-ACTIVATING OLIGONUCLEOTIDES 

RELATED APPLICATIONS 

[0001] This application claims the benefit of priority to United States Application 
60/202,028, filed May 4, 2000, herein incorporated by reference in its entirety. 

FEDERAL GOVERNMENT SUPPORT 

[0002] The research described herein was supported by grant number GM43974, 
awarded by the National Institutes of Health. The federal government may have 
certain rights in the invention. 

FIELD OF THE INVENTION 

[0003] The present invention relates generally to cell cycle progression and more 
specifically to molecules involved in regulating DNA replication checkpoints. 

BACKGROUND OF THE INVENTION 

[0004] DNA damage, if not properly repaired, results in genomic instability, 
which is highly mutagenic and potentially lethal to the cell During the cell cycle, the 
integrity of chromosomal DNA is ensured by surveillance systems that sense 
damaged DNA and arrest the cell cycle, giving the cell more time for repair processes. 
The surveillance systems that inhibit the entry into mitosis in the presence of damaged 
(or unreplicated) DNA consist of checkpoint signaling pathways that ultimately 
regulate the Cdc2-cyclin B complex, also known as maturation or M-phase promoting 
factor (MPF). In various organisms, these pathways contain the phosphoinositide 
kinase (PIK) relatives ATM, ATR, Rad3, and Mecl, and the effector kinases Chk2, 
Cdsl, Rad53, and Chkl . The phosphatase Cdc25 and kinases Weel and Mytl directly 
control the inhibitory phosphorylation of Cdc2, which is maintained when upstream 
checkpoint regulators detect damaged or unreplicated DNA in the cell. 

[0005] Although many types of DNA damage can activate a checkpoint, it is 
largely unknown what DNA structure is the ultimate damage signal. The 
radiomimetic agent methylmethane sulfonate (MMS), which elicits a strong Mecl- 
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dependent checkpoint response in budding yeast, creates adducts and apurinic sites, 
which become single-and double-stranded breaks. Likewise, both UV and ionizing 
irradiation induce a DNA damage checkpoint in various organisms. UV causes the 
formation of pyrimidine dimers, which are repaired predominantly by nucleotide 
excision, whereas ionizing irradiation generates mainly double-stranded breaks. 
Since DNA damage can arise by multiple mechanisms and the processing of primary 
DNA lesions can be complex in eukaryotic cells, it has been difficult to characterize 
at a molecular level the DNA structures that elicit checkpoint responses. 

[0006] In addition to DNA damage, DNA replication blocks can also trigger a cell 
cycle arrest. The signal(s) that elicits this arrest is unknown, but possibilities include 
replication intermediates, such as single-stranded DNA, which might accumulate 
when DNA synthesis is stalled. In budding yeast, the length of a cell cycle arrest in 
response to DNA damage correlates with the amount of the single-stranded DNA that 
is generated by endonucleolytic processing. Furthermore, the addition of single- 
stranded Ml 3 DNA XoXenopus egg extracts results in a strong cell cycle delay. 

[0007] Studies of the yeasts have revealed that DNA damage and replication 
blocks may be recognized by a group of proteins, each of which is required for a 
normal checkpoint. In the fission yeast Schizosaccharomyces pombe, for example, 
this group of gene products includes Radl, Rad3, Rad9, Radl7, Rad26, Husl, Cut5, 
and Crb2. Rad3 has substantial structural similarity to the human ATM and ATR 
proteins, each of which possesses protein kinase activity. A similar pathway exists in 
the budding yeast Saccharomyces cerevisiae. The biochemical functions of the other 
proteins in this group are poorly understood, but they are currently thought to be 
involved in sensing damaged DNA or stalled replication complexes. Cdsl and Chkl 
are two effector kinases with overlapping functions that receive signals from upstream 
checkpoint sensors. DNA damage and replication blocks activate Cdsl by a 
mechanism that requires these proteins. Chkl, however, is normally activated in 
response to only DNA damage. Both Cdsl and Chkl phosphorylate and inhibit the 
function of Cdc25, the protein phosphatase that dephosphorylates tyrosine-15 of the 
cyclin-dependent kinase Cdc2. Mammalian homologues of many of these checkpoint 
proteins have been isolated and have been shown to possess similar biochemical 
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functions, indicating that many features of these checkpoint pathways have been 
conserved throughout evolution. 

[0008] Xenopus egg extracts have been used to study vertebrate checkpoint 
mechanisms. Immunodepletion of a Xenopus homologue of Chkl (Xchkl) results in 
a substantial but not complete abrogation of the cell cycle delay in Xenopus egg 
extracts in response to replication blocks induced by aphidicolin or to UV-damaged 
DNA. Xchkl is involved in a caffeine-sensitive pathway, but that a caffeine- 
insensitive pathway is also involved in the response to aphidicolin and UV radiation. 
There is therefore a need to identify all key components of checkpoint pathways. 

SUMMARY OF THE INVENTION 

[0009] The present invention provides substantially pure polypeptides 
characterized as (a) phosphorylating Cdc25 or a homologue thereof; (b) having a 
molecular mass of about 58 kD; (c) having about 517 amino acids; (d) having SQ/TQ 
motifs at the amino terminal region; (e) having a carboxyl terminal kinase domain; 
and (f) having an amino terminal forkhead-associated domain. 

[0010] Also provided by the invention is a substantially pure polypeptide 

having an amino acid sequence as set forth in SEQ ID NO:2 or conservative variants 
thereof 

[0011] Further provided by the invention is an isolated polynucleotide 

encoding a polypeptide polypeptides characterized as (a) phosphorylating Cdc25 or a 
homologue thereof; (b) having a molecular mass of about 58 kD; (c) having about 517 
amino acids; (d) having SQ/TQ motifs at the amino terminal region; (e) having a 
carboxyl terminal kinase domain; and (f) having an amino terminal forkhead- 
associated domain. The invention also provides an isolated polynucleotide having the 
sequence set forth in SEQ ID NOT and degenerate variants thereof 

[0012] The invention also provides isolated polynucleotides isolated 

polynucleotide selected from the group consisting of (a) a polynucleotide encoding a 
polypeptide having an amino acid sequence as set forth in SEQ ED NO:2; (b) a 
polynucleotide of (a), wherein all T's are U; and (c) a polynucleotide complementary 
to (a) or (b). Also provided are isolated polynucleotide fragments having at least 15 



4 



base pairs and that hybridizes to a polynucleotide encoding a polypeptide having the 
amino acid sequence set forth in SEQ ID NO:2. 

[0013] Also provided by the present invention are isolated oligonucleotides 

having GACTCCGAGAACCAATTGC (SEQ ID N03), . 
GCAATTGGTTCTCGGAGTC (SEQ ID NO:4), and 
GCGGCACGTTCTCGTGCCGC (SEQ ID NO:5). 

[0014] The invention further provides an antibody that binds to a Cdsl 

polypeptide or immunoreactive fragments thereof. Polyclonal and monoclonal 
antibodies are contemplated. 

[0015] Also provided are methods for increasing mitotic delay in a vertebrate 

cell. The method includes providing to the cell one or more oligonucleotides that 
from double-stranded DNA. 

[0016] Yet another embodiment of the invention provides a method for 

identifying a reagent that modulates phosphorylation of a polypeptide. The method 
includes incubating a reagent with the polypeptide, and one or more oligonucleotides 
that form double-stranded DNA, under conditions that allow the components to 
interact with each other; and comparing the phosphorylation of the polypeptide to 
phosphorylation of a polypeptide not incubated with the reagent, wherein a difference 
in phosphorylation is indicative of a reagent that modulates phosphorylation of the 
polypeptide. The modulation can be an increase in phosphorylation or a decrease in 
phosphorylation. 

[0017] Yet a further embodiment of the invention provides a method for 

modulating cell cycle progression in a cell. The method includes providing to the cell 
a compound that affects the activity or expression of a Cdsl polypeptide, thereby 
modulating cell cycle progression. 

[0018] Another embodiment of the invention provides a method of treating a 

subject having a cellular disorder associated with increased cell cycle progression 
compared to a subject not having the cellular disorder. The method includes 
administering to a subject having the disorder a therapeutically effective amount of a 
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reagent that increases a Cdsl polypeptide activity, thereby treating the cellular 
disorder. 

[0019} Yet another embodiment of the invention provides a method of 

diagnosing a Cdsl- associated disorder in a subject. The method includes 
determining the level of Cdsl mRNA or protein expression in the subject, wherein a 
low level of Cdsl in the subject compared to the level in a subject not having a Cds- 
associated disorder is indicative of a Cds-associated disorder. 

[0020] Also provided is a kit for activating a Cdsl polypeptide. The kit 

includes double-stranded DN A and a container for the DNA. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0021] Figure 1 shows the nucleotide sequence of Xenopus laevis protein 

kinase Cdsl (Cdsl) mRNA, complete cds (Accession Number AF174295; SEQ ID 
NOT). 

[0022] Figure 2 shows the amino acid sequences of Xenopus laevis protein 

kinase Cdsl (Accession Number AAF75829; SEQ ED NO:2). 

[0023] Figure 3 is a flow diagram illustrating a computer system, data 

retrieving device and display 

[0024] Figure 4 is a flow diagram illustrating one embodiment of process 200 

for comparing a new nucleotide or protein sequence with a database of sequences in 
order to determine the homology levels between the new sequence and the sequences 
in the database. 

[0025] Figure 5 is a flow diagram illustrating one embodiment of a process 

250 in a computer for determining whether two sequences are homologous. 

[0026] Figure 6 is a flow diagram illustrating one embodiment of a process 

300 for comparing features in polynucleotide and polypeptide sequences. 



DETAILED DESCRIPTION OF THE INVENTION 



[0027] The present invention provides polypeptides that are involved in 

regulating the progression of the cell cycle. The present invention is based on the 
discovery that a novel polypeptide, Cdsl, plays a significant role in the DNA damage 
checkpoint. The Cdsl polypeptide is activated by double-stranded DNA and is 
phosphorylated in response to the presence of double-stranded DNA. Once activated, 
the polypeptide can phosphorylate Cdc25 polypeptides. The phosphorylation of the 
polypeptide and the following phosphorylation of Cdc25 polypeptides ensures that the 
timing of the cell cycle progression is appropriate. 

[0028] When a cell encounters a problem such as damaged DNA or a block to 

replication, it can call upon a variety of mechanisms to fix the problem. But these 
mechanisms can take time, and it is often crucial that the cell does not continue 
through the cell cycle until the problem is fixed. That is where cell cycle checkpoints 
come in. The checkpoints recognize the problem and delay cell cycle progression by 
inhibiting the basic cell cycle machinery until the problem is fixed (Hartwell and 
Weinert, (1989) Science 246, 629-634; Elledge, (1996) Science 274 y 1664-1672; 
Rhind and Russell, (1998) Curr Opin. Cell Biol 10, 749-58). These checkpoints can 
also regulate transcription and may directly regulate repair machinery. Many, if not 
all, of the major cell cycle transitions are regulated by one or another checkpoint. 
Checkpoints include DNA damage and replication checkpoints, which have served as 
the prototypic checkpoint pathways. These checkpoints are triggered by various 
forms of DNA damage, and various treatments that block replication, respectively. 

[0029] One can divide checkpoints into three parts: a sensor, a transduction 

pathway and a target. The transduction pathways for the DNA damage and 
replication checkpoints are composed of a shared group of conserved proteins that 
may also serve as the sensors. The pathways have been recently reviewed (Elledge, 
(1996) supra; Longhese et aL, (1998) EMBOJ. 17, 5525-5528; Rhind and Russell, 

1 998, supra; Dasika et til, (1 999) Oncogene 18, 7883-7899). Of the proteins known 
to be involved, one is a large protein kinase of the DNA-PK family, known as Rad3 in 
the fission yeast Schizosaccharomyces pombe, Mecl in budding yeast Saccharomyces 
cerevisiat, MEI-41 in the fruit fly Drosophila melanogastex, and X-ATM in the frog 
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Xenopus laevis (Zakian, 1995). Two homologues, ATM and ATR, have been 
identified in humans and mice (Westphal, 1997). For convenience, these homologues 
are referred to generically as ATMs but the specific name is used when referring to a 
specific organism. By analogy with DNA-PK, a kinase that is activated by binding to 
DNA ends, it is proposed that ATM acts to recognize the DNA damage or stalled 
replication forks and initiate the checkpoint signal (Hartley et al y (1995) Cell 82, 849- 
856; Bentley et al., (\996) EMBO J. lf>, 6641-6651). The other members of this 
pathway could serve as regulatory subunits of a complex that has ATM as its core 
(Longhese et al. 7 (1998) supra). Although it is appealing, there is little direct 
evidence for this model. However, the fact that the checkpoints share many upstream 
proteins suggests that they may recognize a similar or overlapping set of DNA 
structures. Downstream of ATM in each species are homologues of the Chkl and 
Cdsl protein kinases (see for example, Weinert et al. y (1994) Genes Dev. 8, 652-665; 
Kumagai et al, (1998). J. Cell Biol 142, 1559-1569; Guo and Dunphy, (2000)Mol 
Biol. Cell 11, 1535-1546, incorporated herein by reference; Liu et al y 2000b). 

[0030] The founding member of the Cdsl family is budding yeast Rad53 

(Allen et al, (\994) Genes Dev. 8, 2401-2415). Like Chkl, Rad53 is widely 
conserved (Murakami and Okayama, (\99S) Nature 374 , 817-819; Matsuoka et all, 
(1998) Science 282, 1 893-1897; Blasina et al y (1999) Curr. Biol % 1-10; Brown et 
al, (\999) Proc.Natl. Acad Sci. USA 96, 3745-3750; Chaturvedi etal y (1999) 
Oncogene 18, 4047-4054; Tominaga et al y (1 999) J. Biol Chem. 214, 3 1 463-3 1467; 
Guo and Dunphy, (2000) supra). These homologues are generally called Cdsl, but 
the specific name is used when referring to a specific organism. The Cdsl 
homologues are recognizable by a similar kinase domain and an N-terminal forkhead 
associated (FHA) domain (Blasina et al y (1999) supra). 

[0031] Accordingly, the present invention provides a substantially polypeptide 

characterized as (a) being phosphorylated in the presence of damaged double-stranded 
DNA; (b) phosphorylating Cdc25; (c) being activated by poly(dT) 4 o; (d) having a 
molecular mass of about 58 kD; (e) having about 517 amino acids; (f) having SQ/TQ 
motifs at the amino terminal region; (g) having a carboxyl terminal kinase domain; 
and (h) having an amino terminal forkhead-associated domain. 



8 



[0032] XCdsl becomes highly phosphorylated when Ml 3 DNA or linearized, 

double-stranded plasmids are added to Xenopus egg extracts. Likewise, Cdsl 
undergoes checkpoint-associated phosphorylation in response to poly(dT) 4 o and 
various double- stranded oligonucleotides (see Examples section). When Ml 3 and 
poly(dT) 40 are added to the extracts in a single-stranded form, both of these templates 
are replicated very efficiently by the DNA synthetic machinery present in egg extracts 
and Cdsl is phosphorylated. Indeed, inhibition of M13 DNA replication by treatment 
with actinomycin D abolishes the modification of Cds 1 . Furthermore, various single- 
stranded oligonucleotides that are not replicated by egg extracts do not bring about the 
phosphorylation of Cdsl . 

[0033] Xcdsl undergoes checkpoint associated phosphorylation in the 

presence of double stranded oligonucleotides. For example, when isolated 
nucleotides having the sequence GACTCCGAGAACCAATTGC (SEQ ID NO:3) and 
GCAATTGGTTCTCGGAGTC (SEQ ID NO:4) are added to egg extracts containing 
Cdsl, a double stranded DNA molecule is formed and Cdsl undergoes 
phosphorylation. Similarly, when an isolated oligonucleotide having the sequence 
GCGGCACG TTCTCGTGCCGC (SEQ ID NO. 5) is added to egg extracts containing 
Cdsl, phosphorylation of Cdsl ensues. The oligonucleotide sequence is capable of 
forming a hairpin structure by annealing (see underlined regions) and thus acts as 
double-stranded DNA. 

[0034] Cdsl phosphorylates Cdc25, the protein phosphatase that 

dephosphorylates tyrosine 15 of Cdc2 (Zeng et aL y (1998) Nature, 395, 507-510). 
When phosphorylated, Cdc25 activity is inhibited. Xcdsl possesses similar 
biochemical functions. Phosphorylation of Xenopus Cdc25 occurs on Serine residue 
287 (Ser-287) in the 14-3-3 binding site of Xenopus Cdc25. 

[0035] Xcdsl is activated by poly(dT)40. When Cdsl is phosphorylated in 

extracts containing poly (dT) 4 o ? hyperphosphorylated Cdsl shows a five- to six-fold 
increase over background in its kinase activity toward GST-Cdc25[254 316J-WT. 

[0036] Cdsl polypeptides contain a relatively large number of SQ/TQ motifs 

at the amino terminal region. The serines (S) and threonines (T), each adjacent to a 
glutamine (Q) residue in this type of motif, are potential substrates for kinases such as 
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ATM, ATR, and DNX-PK that are involved in checkpoint pathways (Kim et aL y 
(1999) 1 Biol Chem r 274, 37538-37543). 

[0037] Cdsl is further characterized as having a carboxyl terminal kinase 

domain. This region is the most conserved region of Xcdsl when compared to Chk2, 
the human homologue of Cdsl . 

[0038] Xcdsl is also characterized as having an amino terminal forkhead- 

assoicated domain (FHA domain). FHA domains, originally recognized in the 
forkhead transcription factor, are believed to act as protein-protein interaction 
domains, and in some instances bind specifically to phosphorylated partners 
(Hofmann and Bucher, (1 995) Trends Biochem. Sci. 20, 347-349; Sun et aL y (1 998) 
Science2$± y 272-274; Durocher etal y (\999)MoI. Cell 4, 387-394. 

[0039] An exemplary Xcdsl polypeptide is set forth in SEQ ID NO:2. The 

invention includes conservative variants thereof The terms "conservative variation" 
and "substantially similar" as used herein denotes the replacement of an amino acid 
residue by another, biologically similar residue. Examples of conservative variations 
include the substitution of one hydrophobic residue such as isoleucine, valine, leucine 
or methionine for another, or the substitution of one polar residue for another, such as 
the substitution of arginine for lysine, glutamic acid for aspartic acid, or glutamine for 
asparagine, and the like. The terms "conservative variation" and "substantially 
similar" also include the use of a substituted amino acid in place of an unsubstituted 
parent amino acid provided that antibodies raised to the substituted polypeptide also 
immunoreact with the unsubstituted polypeptide. 

[0040] In one embodiment of the invention, the polypeptide is identical with 

or homologous to a Xenopus Cdsl polypeptide, such as a mammalian Cdsl 
polypeptide. For instance, the Cdsl polypeptide preferably has an amino acid 
sequence at least 80% homologous to a polypeptide represented by SEQ ID No:2, 
though polypeptides with higher sequence homologies of, for example, 85%, 90% or 
95% are also embraced by the invention. The Xcds polypeptides can comprise a full 
length protein, such as represented in the sequence listing, or it can comprise a 
fragment of, for instance, at least 5, 10, 20, 50, 100, 150 or 200 amino acids in length. 
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[0041] As is well known, genes for a particular polypeptide may exist in 

single or multiple copies within the genome of an individual. Such duplicate genes 
may be identical or may have certain modifications, including nucleotide 
substitutions, additions or deletions, which all still code for polypeptides having 
substantially the same activity. The term "nucleic acid sequence encoding a Cdsl 
polypeptide" may thus refer to one or more genes within a particular individual. 
Moreover, individual organisms may bear different nucleotide sequences, called 
alleles, which code for substantially the same polypeptide. Such allelic differences 
may or may not result in differences in amino acid sequence of the encoded 
polypeptide yet still encode a protein with the same biological activity. 

[0042] "Homology" refers to sequence similarity between two peptides or 

between two nucleic acid molecules. Homology can be determined by comparing a 
position in each sequence which may be aligned for purposes of comparison. When a 
position in the compared sequence is occupied by the same base or amino acid, then 
the molecules are homologous at that position. A degree of homology between 
sequences is a function of the number of matching or homologous positions shared by 
the sequences. Homology can be shown by alignment of sequences, using for 
example the Pretty Plot function of the GCG program. PRETTYPLOT displays 
multiple sequence alignments and calculates a consensus sequence. PRETTYPLOT 
prints and plots sequences with their columns aligned. This utility is used after a 
number of sequences have had gaps added to make them all align. PRETTYPLOTs 
output allows one to look at relationships among several sequencesAn "unrelated" or 
"non-homologous" sequence shares less than 25 percent identity, though preferably 
less than 1 5 percent identity, with one of the Cdsl sequences of the present invention. 

10043] The term "isolated" as also used herein with respect to nucleic acids, 

such as DN A or RNA, refers to molecules separated from other DNAs, or RNAs, 
respectively, that are present in the natural source of the macromolecule. For 
example, an isolated nucleic acid encoding one of the subject Cdsl polypeptides 
preferably includes no more than 10 kilobases (kb) of nucleic acid sequence which 
naturally immediately flanks the Xcdsl gene in genomic DNA, more preferably no 
more than 5 kb of such naturally occurring flanking sequences, and most preferably 
less than 1 .5 kb of such naturally occurring flanking sequence. The term isolated as 
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used herein also refers to a nucleic acid or peptide that is substantially free of cellular 
material, viral material, or culture medium when produced by recombinant DNA 
techniques, or chemical precursors or other chemicals when chemically synthesized. 
Moreover, an "isolated nucleic acid" is meant to include nucleic acid fragments which 
are not naturally occurring as fragments and would not be found in the natural state. 

[0044] Exemplary polynucleotides encoding a Cdsl polypeptide is set forth in 

SEQ ID NO: 1 . The term "polynucleotide", "nucleic acid", "nucleic acid sequence", 
or "nucleic aci^ molecule" refers to a polymeric form of nucleotides at least 10 bases 
in length. By "isolated polynucleotide" is meant a polynucleotide that is not 
immediately contiguous with both of the coding sequences with which it is 
immediately contiguous (one on the 5' end and one on the 3' end) in the naturally 
occurring genome of the organism from which it is derived. The term therefore 
includes, for example, a recombinant DNA which is incorporated into a vector; into 
an autonomously replicating plasmid or virus; or into the genomic DNA of a 
prokaryote or eukaryote, or which exists as a separate molecule (e.g., a cDNA or 
genomic DNA) independent of other sequences. It also includes genomic DNA 
which refers to a contiguous sequence of nucleotide that includes one or more protein 
coding regions, introns, upstream and downstream regulatory sequences, i.e., non- 
coding 5'- and 3'- regulatory sequences. Thus, the term "polynucleotide encoding a 
polypeptide" encompasses a polynucleotide which includes coding sequence for the 
polypeptide as well as a polynucleotide which includes additional coding and/or non- 
coding sequence. 

[0045] The invention also provides an isolated polynucleotide having at least 

1 5 continuous base pairs that hybridizes to a polynucleotide selected from the group 
consisting of (a) a polynucleotide encoding a polypeptide having an amino acid 
sequence as set forth in amino acids 1 to 86 or amino acids 46 1 to 5 1 7 of SEQ ID 
NO:2; (b) a polynucleotide of (a), wherein T can be U; (c) a polynucleotide 
complementary to (a) or (b); (d) a polynucleotide having a nucleotide sequence as set 
forth in nucleotides 224 to 481 or nucleotides 1604 to 1770 of SEQ ID NO: 1 ; and 
(e) degenerate variants of (a), (b), (c) or (d). 



[0046] The nucleotides of the invention can be deoxyribonucleotides, 

ribonucleotides in which uracil (U) is present in place of thymine (T), or modified 
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forms of either nucleotide. The nucleotides of the invention can be complementary to 
the deoxynucleotides or to the ribonucleotides. A polynucleotide encoding a Cds 
protein includes "degenerate variants", sequences that are degenerate as a result of the 
genetic code. There are 20 naturally occuring amino acids, most of which are 
specified by more than one codon. Therefore, all degenerate nucleotide sequences are 
included in the invention as long as the amino acid sequence of a polypeptide encoded 
by the nucleotide sequence of SEQ ID NO: 1 is functionally unchanged. 

[0047J A nucleic acid molecule encoding a Cds I protein includes sequences 

encoding functional Cdsl polypeptides as well as functional fragments thereof As 
used herein, the term "functional polypeptide" refers to a polypeptide which possesses 
biological function or activity which is identified through a defined functional assay 
{e.g., Examples Section), and which is associated with a particular biologic, 
morphologic, or phenotypic alteration in the cell, for example an alteration in the 
timing of the cell cycle; The term "functional fragments of Xcdsl protein" refers to 
fragments of a Xcdsl protein that retain a XCdsl activity, e.g., the ability to be 
phophorylated in the presence of double-stranded DNA, being able to phosphorylate 
Cdc25, and the like. Additionally, functional Xcdsl fragments may act as 
competitive inhibitors of Xcdsl binding, for example. Biologically functional 
fragments, for example, can vary in size from a polypeptide fragment as small as an 
epitope capable of binding an antibody molecule to a large polypeptide capable of 
participating in the characteristic induction or programming of biological changes 
within a cell. Nucleotide fragments of the invention have at least 1 5 base pairs, 
preferably 20 base pairs, and hybridize to a polynucleotide encoding a polypeptide as 
set forth in SEQ ID NO:2. 

[0048] Yet another embodiment of the invention provides an isolated 

polynucleotide, wherein the nucleotide is at least 15 base pairs in length which 
hybridizes under moderately to highly stringent conditions to DNA encoding a 
polypeptide as set forth in SEQ ID NO:2. In nucleic acid hybridization reactions, the 
conditions used to achieve a particular level of stringency will vary, depending on the 
nature of the nucleic acids being hybridized. For example, the length, degree of 
complementarity, nucleotide sequence composition {e.g., GC v. AT content), and 
nucleic acid type {e.g., RNA v. DNA) of the hybridizing regions of the nucleic acids 
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can be considered in selecting hybridization conditions. An additional consideration 
is whether one of the nucleic acids is immobilized, for example, on a filter. 

[0049] An example of progressively higher stringency conditions is as 

follows: 2 x SSC/0. 1% SDS at about room temperature (hybridization conditions); 
0.2 x SSC/0. 1% SDS at about room temperature (low stringency conditions); 0.2 x 
SSC/0. 1% SDS at about 42°C (moderately stringent conditions); and 0. 1 x SSC at 
about 68°C (highly stringent conditions). Washing can be carried out using only one 
of these conditions, e.g., high stringency conditions, or each of the conditions can be 
used, e.g., for 10-15 minutes each, in the order listed above, repeating any or all of the 
steps listed. However, as mentioned above, optimal conditions will vary, depending . 
on the particular hybridization reaction involved, and can be determined empirically. 

[0050] The present invention also provides a probe/primer comprising a 

substantially purified oligonucleotide, wherein the oligonucleotide comprises a region 
of nucleotide sequence which hybridizes under stringent conditions to at least 10 
consecutive nucleotides of sense or antisense sequence of SEQ. ID No. 1, or naturally 
occurring mutants thereof. In preferred embodiments, the probe/primer further 
comprises a label group attached thereto and able to be detected, e.g. the label group 
is selected from a group consisting of radioisotopes, fluorescent compounds, 
enzymes, and enzyme co-factors. Such probes can be used as a partof a diagnostic 
test kit for identifying transformed cells, such as for measuring a level of a Cdsl- 
encoding nucleic acid in a sample of cells isolated from a patient; e.g. for measuring 
the mRNA level in a cell. 

[0051] Another aspect of the invention includes an isolated oligonucleotide 

having the sequence GACTCCGAGAACCAATTGC (SEQ ID NO 3). A further 
embodiment of the invention includes an isolated oligonucleotide having the sequence 
GCAATTGGTTCTCGGAGTC (SEQ ID NO:4). Yet another embodiment of the 
invention includes an isolated oligonucleotide having the sequence 
GCGGCACGTTCTCGTGCCGC (SEQ ID NO:5). 

r 

[0052] Antibodies of the invention may bind to Cdsl polypeptides provided 

by the invention to prevent normal interactions of Cdsl proteins. Binding of 
antibodies to Cdsl proteins can interfere with, for example, cell cycle progression. 
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Binding of antibodies can interfere with Xcdsl protein phosphorylation of a cdc25 
protein, and the like. 

[0053] The antibodies of the invention can be used in any subject in which it 

is desirable to administer in vitro or in vivo immunodiagnosis or immunotherapy. The 
antibodies of the invention are suited for use, for example, in immunoassays in which 
they can be utilized in liquid phase or bound to a solid phase carrier. In addition, the 
antibodies in these immunoassays can be detectably labeled in various ways. 
Examples of types of immunoassays which can utilize antibodies of the invention are 
competitive and non-competitive immunoassays in either a direct or indirect format 
Examples of such immunoassays are the radioimmunoassay (RI A) and the sandwich 
(immunometric) assay. Detection of the antigens using the antibodies of the invention 
can be done utilizing immunoassays which are run in either the forward, reverse, or 
simultaneous modes, including immunohistochemical assays on physiological 
samples. Those of skill in the art will know, or can readily discern, other 
immunoassay formats without undue experimentation. 

[0054] The term "antibody" as used in this invention includes intact molecules 

as well as fragments thereof, such as Fab, F(ab')2, and Fv which are capable of 
binding to an epitopic determinant present in an invention polypeptide. Such antibody 

fragments retain some ability to selectively bind with its antigen or receptor. 

j 

[0055] Methods of making these fragments are known in the art. (See for 

example, Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor 
Laboratory, New York (1988), incorporated herein by reference). Monoclonal 
antibodies are made from antigen containing fragments of the protein by methods well 
known to those skilled in the art (Kohler & Milstein, Nature 256:495 (1975), Coligan 
et aL ? sections 2.5. 1-2.6.7; and Harlow et al, Antibodies: A Laboratory Manual, page 
726 (Cold Spring Harbor Pub. 1988), which are hereby incorporated by reference. 
Briefly, monoclonal antibodies can be obtained by injecting mice with a composition 
comprising an antigen/ligand, verifying the presence of antibody production by 
analyzing a serum sample, removing the spleen to obtain B lymphocytes, fusing the B 
lymphocytes with myeloma cells to produce hybridomas, cloning the hybridomas, 
selecting positive clones that produce antibodies to the antigen, and isolating the 
antibodies from the hybridoma cultures. Monoclonal antibodies can be isolated and 
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purified from hybridoma cultures by a variety of well-established techniques. Such 
isolation techniques include affinity chromatography with Protein-A Sepharose, size- 
exclusion chromatography, and ion-exchange chromatography. See, e.g., Coligan et 
aL, sections 2.7.1-2.7.12 and sections 2.9.1-2.9.3; Barnes et al. y "Purification of 
Immunoglobulin G (IgG)" in Methods In Molecular Biology, Vol. 10, pages 79-104 
(Humana Press 1992). 

[0056] Antibodies that bind to an invention polypeptide can be prepared using 

an intact polypeptide or fragments containing small peptides of interest as the 
immunizing antigen. For example, it may be desirable to produce antibodies that 
specifically bind to the amino- or carboxyl-terminal domains of an invention 
polypeptide. For the preparation of polyclonal antibodies, the polypeptide or peptide 
used to immunize an animal is derived from translated cDNA or chemically 
synthesized and can be conjugated to a carrier protein, if desired. Commonly used 
carrier proteins which may be chemically coupled to the immunizing peptide include 
keyhole limpet hemocyanin (KLH), thyroglobulin, bovine serum albumin (BSA), 
tetanus toxoid, and the like. 

f 0057] Invention polyclonal or monoclonal antibodies can be further purified, 

for example, by binding to and elution from a matrix to which the polypeptide or a 
peptide to which the antibodies were raised is bound. Those of skill in the art will 
know of various techniques common in the immunology arts for purification and/or 
concentration of polyclonal antibodies, as well as monoclonal antibodies (See, for 
example, Coligan, et al. y Unit 9, Current Protocols in Immunology, Wiley 
Interscience, 1994, incorporated herein by reference). 

[0058] The antibodies of the invention can be bound to many different carriers 

and used to detect the presence of an antigen comprising the polypeptides of the 
invention. Examples of well-known carriers include glass, polystyrene, 
polypropylene, polyethylene, dextran, nylon, amylases, natural and modified 
celluloses, polyacrylamides, agaroses and magnetite. The nature of the carrier can be 
either soluble or insoluble for purposes of the invention Those skilled in the art will 
know of other suitable carriers for binding antibodies, or will be able to ascertain 
such, using routine experimentation. 
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[0059] There are many different labels and methods of labeling known to 

those of ordinary skill in the art. Examples of the types of labels which can be used in 
the present invention include enzymes, radioisotopes, fluorescent compounds, 
colloidal metals, chemiluminescent compounds, phosphorescent compounds, and 
bioluminescent compounds. Those of ordinary skill in the art will know of other 
suitable labels for binding to the antibody, or will be able to ascertain such, using 
routine experimentation. 

[0060] Another technique that may also result in greater sensitivity consists of 

coupling the antibodies to low molecular weight haptens. These haptens can then be 
specifically detected by means of a second reaction. For example, it is common to 
use such haptens as biotin, which reacts with avidin, or dinitrophenyl, puridoxal, and 
fluorescein, which can react with specific antihapten antibodies. 

[0061 ] In using the monoclonal and polyclonal antibodies of the invention for 

the in vivo detection of antigen, e.g., a Cdsl protein, the detectably labeled antibody 
is given a dose which is diagnostically effective. The term "diagnostically effective" 
means that the amount of detectably labeled antibody is administered in sufficient 
quantity to enable detection of the site having the antigen comprising a polypeptide of 
the invention for which the antibodies are specific. 

[0062] The concentration of detectably labeled antibody which is administered 

should be sufficient such that the binding to those cells having the polypeptide is 
detectable compared to the background. Further, it is desirable that the detectably 
labeled antibody be rapidly cleared from the circulatory system in order to give the 
best target-to-background signal ratio. 

[0063] As a rule, the dosage of detectably labeled antibody for in vivo 

treatment or diagnosis will vary depending on such factors as age, sex, and extent of 
disease of the individual. Such dosages may vary, for example, depending on whether 
multiple injections are given, antigenic burden, and other factors known to those of 
skill in the art. 

[0064] A polynucleotide agent can be contained in a vector, which can 

facilitate manipulation of the polynucleotide, including introduction of the 
polynucleotide into a target cell. The vector can be a cloning vector, which is useful 
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for maintaining the polynucleotide, or can be an expression vector, which contains, in 
addition to the polynucleotide, regulatory elements useful for expressing the 
polynucleotide and, where the polynucleotide encodes a peptide, for expressing the 
encoded peptide in a particular cell. An expression vector can contain the expression 
elements necessary to achieve, for example, sustained transcription of the encoding 
polynucleotide, or the regulatory elements can be operatively linked to the 
polynucleotide prior to its being cloned into the vector. 

[0065] An expression vector (or the polynucleotide) generally contains or 

encodes a promoter sequence, which can provide constitutive or, if desired, inducible 
or tissue specific or developmental stage specific expression of the encoding 
polynucleotide, a poly- A recognition sequence, and a ribosome recognition site or 
internal ribosome entry site, or other regulatory elements such as an enhancer, which 
can be tissue specific. The vector also can contain elements required for replication in 
a prokaryotic or eukaryotic host system or both, as desired. Such vectors, which 
include plasmid vectors and viral vectors such as bacteriophage, baculovirus, 
retrovirus, lentivirus, adenovirus, vaccinia virus, semliki forest virus and adeno- 
associated virus vectors, are well known and can be purchased from a commercial 
source (Promega, Madison WI; Stratagene, La Jolla CA; GEBCO/BRL, Gaithersburg 
MD) or can be constructed by one skilled in the art (see, for example, Meth EnzymoL, 
Vol 185, Goeddel, ed. (Academic Press, Inc., 1990); Jolly, Cane. Gene Ther^ 1:51-64, 
1994; Flotte, l_Bioenerg. Biomemb^ 25:37-42, 1993; Kirshenbaum et al y J. Clin. 
Invest, 92:381-387, 1993; each of which is incorporated herein by reference). 

[0066] A polynucleotide useful in a method of the invention also can be 

operatively linked to tissue specific regulatory element. For example, a neuron 
specific regulatory element can be employed such that expression of an encoded 
peptide agent is restricted to neurons in an individual, or to neurons in a mixed 
population of cells in culture, for example, an organ culture. For example, neuronal 
promoters such as the myelin basic protein promoter and other neuronal-specific 
promotes known to those of skill in the art may be used. Muscle-regulatory elements 
including, for example, the muscle creatine kinase promoter (Sternberg et al, Mol 
Cell Biol 8:2896-2909, 1988, which is incorporated herein by reference) and the 
myosin light chain enhancer/promoter (Donoghue et al y Proa Natl Acad. Sci. f USA 
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88:5847-5851, 1991, which is incorporated herein by reference) are well known in the 
art. A variety of other promoters have been identified which are suitable for up 
regulating expression in cardiac tissue. Included, for example, are the cardiac 
1-myosin heavy chain (AMHC) promoter and the cardiac I-actin promoter. Other 
examples of tissue-specific regulatory elements include, tissue-specific promoters, 
pancreatic (insulin or elastase), and actin promoter in smooth muscle cells. Through 
the use of promoters, such as milk-specific promoters, recombinant retroviruses may 
be isolated directly from the biological fluid of the progeny. 

[0067] A Cdsl polynucleotide of the invention can be inserted into a vector, 

which can be a cloning vector or a recombinant expression vector. The term 
"expression vector" refers to a plasmid, virus or other vehicle known in the art that 
has been manipulated by insertion or incorporation of a polynucleotide, particularly, 
with respect to the present invention, a polynucleotide encoding all or a peptide 
portion of a CDsl protein. Such expression vectors contain a promoter sequence, 
which facilitates the efficient transcription of the inserted genetic sequence of the 
host. The expression vector generally contains an origin of replication, a promoter, as 
well as specific genes, which allow phenotypic selection of the transformed cells. 
Vectors suitable for use in the present invention include, but are not limited to, the 
T7-based expression vector for expression in bacteria (Rosenberg, et al, Gene 
56:125, 1987), the pMSXND expression vector for expression in mammalian cells 
(Lee and Nathans, /. Biol Chem. 263:3521, 1988) and baculovirus-derived vectors for 
expression in insect cells. The DNA segment can be present in the vector operably 
linked to regulatory elements, for example, a promoter, which can be a T7 promoter, 
metallothionein I promoter, polyhedrin promoter, or other promoter as desired, 
particularly tissue specific promoters or inducible promoters. 

[0068] Viral expression vectors can be particularly useful for introducing a 

polynucleotide useful in a method of the invention into a cell, particularly a cell in a 
subject. Viral vectors provide the advantage that they can infect host cells with 
relatively high efficiency and can infect specific cell types. For example, a 
polynucleotide encoding a Cdsl protein or functional peptide portion thereof can be 
cloned into a baculovirus vector, which then can be used to infect an insect host cell, 
thereby providing a means to produce large amounts of the encoded protein or peptide 
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portion. The viral vector also can be derived from a virus that infects cells of an 
organism of interest, for example, vertebrate host cells such as mammalian, avian or 
piscine host cells. Viral vectors can be particularly useful for introducing a 
polynucleotide useful in performing a method of the invention into a target cell. Viral 
vectors have been developed for use in particular host systems, particularly 
mammalian systems and include, for example, retroviral vectors, other lentivirus 
vectors such as those based on the human immunodeficiency virus (HIV), adenovirus 
vectors, adeno-associated virus vectors, herpes virus vectors, vaccinia virus vectors, 
and the like (see Miller and Rosman, BioTechniques 7:980-990, 1992; Anderson et 
al y Nature 392:25-30 Supply 1998; Verma and Somia, Nature 389:239-242, 1997; 
Wilson, New Engl J. Med, 334:1 185-1 187 (1996), each of which is incorporated 
herein by reference). 

[0069] When retroviruses, for example, are used for gene transfer, replication 

competent retroviruses theoretically can develop due to recombination of retroviral 
vector and viral gene sequences in the packaging cell line utilized to produce the 
retroviral vector. Packaging cell lines in which the production of replication 
competent virus by recombination has been reduced or eliminated can be used to 
minimize the likelihood that a replication competent retrovirus will be produced. All 
retroviral vector supernatants used to infect cells are screened for replication 
competent virus by standard assays such as PCR and reverse transcriptase assays. 
Retroviral vectors allow for integration of a heterologous gene into a host cell 
genome, which allows for the gene to be passed to daughter cells following cell 
division. 

[0070] A polynucleotide, which can be contained in a vector, can be 

introduced into a cell by any of a variety of methods known in the art (Sambrook et 
al 7 Molecular Cloning: A laboratory manual (Cold Spring Harbor Laboratory Press 
1989); Ausubel et aL, Current Protocols in Molecular Biology . John Wiley and Sons, 
Baltimore, MP (1987, and supplements through 1995), each of which is incorporated 
herein by reference). Such methods include, for example, transfection, lipofection, 
microinjection, electroporation and, with viral vectors, infection; and can include the 
use of liposomes, microemulsions or the like, which can facilitate introduction of the 
polynucleotide into the cell and can protect the polynucleotide from degradation prior 



20 



to its introduction into the cell The selection of a particular method will depend, for 
example, on the cell into which the polynucleotide is to be introduced, as well as 
whether the cell is isolated in culture, or is in a tissue or organ in culture or in situ, 

[0071] Introduction of a polynucleotide into a cell by infection with a viral 

vector is particularly advantageous in that it can efficiently introduce the nucleic acid 
molecule into a cell ex vivo or in vivo (see, for example, U.S. Patent No. 5,399,346, 
which is incorporated herein by reference). Moreover, viruses are very specialized 
and can be selected as vectors based on an ability to infect and propagate in one or a 
few specific cell types. Thus, their natural specificity can be used to target the nucleic 
acid molecule contained in the vector to specific cell types. As such, a vector based 
on an HIV can be used to infect T cells, a vector based on an adenovirus can be used, 
for example, to infect respiratory epithelial cells, a vector based on a herpesvirus can 
be used to infect neuronal cells, and the like. Other vectors, such as adeno-associated 
viruses can have greater host cell range and, therefore, can be used to infect various 
cell types, although viral or non-viral vectors also can be modified with specific 
receptors or ligands to alter target specificity through receptor mediated events. 

[0072] A polynucleotide sequence encoding a Cdsl protein can be expressed 

in either prokaryotes or eukaryotes. Hosts can include microbial, yeast, insect and 
mammalian organisms. Methods of expressing polynucleotides having eukaryotic or 
viral sequences in prokaryotes are well known in the art, as are biologically functional 
viral and plasmid DNA vectors capable of expression and replication in a host. 
Methods for constructing an expression vector containing a polynucleotide of the 
invention are well known, as are factors to be considered in selecting transcriptional 
or translational control signals, including, for example, whether the polynucleotide is 
to be expressed preferentially in a particular cell type or under particular conditions 
(see, for example, Sambrook et al. y supra, 1989). 

[0073] A variety of host cell/expression vector systems can be utilized to 

express a Cdsl polypeptide coding sequence, including, but not limited to, 
microorganisms such as bacteria transformed with recombinant bacteriophage DNA, 
plasmid DNA or cosmid DNA expression vectors; yeast cells transformed with 
recombinant yeast expression vectors, plant cell systems infected with recombinant 
virus expression vectors such as a cauliflower mosaic virus or tobacco mosaic virus, 
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or transformed with recombinant plasmid expression vector such as a Ti plasmid; 
insect cells infected with recombinant virus expression vectors such as a baculovirus; 
animal cell systems infected with recombinant virus expression vectors such as a 
retrovirus, adenovirus or vaccinia virus vector; and transformed animal cell systems 
genetically engineered for stable expression. Where the expressed Cdsl protein is 
post-translationally modified, for example, by glycosylation, it can be particularly 
advantageous to select a host cell/expression vector system that can effect the desired 
modification, for example, a mammalian host cell/expression vector system. 

[0074] Depending on the host cell/vector system utilized, any of a number of 

suitable transcription and translation elements, including constitutive and inducible 
promoters, transcription enhancer elements, transcription terminators, and the like can 
be used in the expression vector (Bitter ei al. f Meth. EnzymoL 153:516-544, 1987). 
For example, when cloning in bacterial systems, inducible promoters such as pL of 
bacteriophage X, plac, ptrp, ptac (ptrp-lac hybrid promoter) and the like can be used. 
When cloning in mammalian cell systems, promoters derived from the genome of 
mammalian cells, for example, a human or mouse metallothionein promoter, or from 
mammalian viruses, for example, a retrovirus long terminal repeat, an adenovirus late 
promoter or a vaccinia virus 7.5K promoter, can be used. Promoters produced by 
recombinant DNA or synthetic techniques can also be used to provide for 
transcription of the inserted GDF receptors coding sequence. 

[0075] In yeast cells, a number of vectors containing constitutive or inducible 

promoters can be used (see Ausubel et aL 7 supra, 1987, see chapter 13; Grant et ai y 
Meth. EnzymoL 153.516-544, 1987; Glover, DNA Cloning Vol. II (1RL Press, 1986), 
see chapter 3; Bitter, Meth. Enzymol. 152:673-684, 1987; see, also, T he Molecular 
Biology of the Yeast Saccharomyces (Eds., Strathern et aL y Cold Spring Harbor 
Laboratory Press, 1982), Vols. I and II). A constitutive yeast promoter such as ADH 
or LEU2 or an inducible promoter such as GAL can be used (Rothstein, DNA 
Cloning Vol. II (supra, 1986), chapter 3). Alternatively, vectors can be used which 
promote integration of foreign DNA sequences into the yeast chromosome. 

[0076] Eukaryotic systems, particularly mammalian expression systems, allow 

for proper post-translational modifications of expressed mammalian proteins. 
Eukaryotic cells which possess the cellular machinery for proper processing of the 
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primary transcript, glycosylation, phosphorylation, and advantageously, plasma 
membrane insertion of the gene product can be used as host cells for the expression of 
a Cdsl protein, or functional peptide portion thereof. 

[0077] Mammalian cell systems which utilize recombinant viruses or viral 

elements to direct expression can be engineered. For example, when using adenovirus 
expression vectors, the Cdsl polypeptide coding sequence can be ligated to an 
adenovirus transcription/translation control complex, e.g., the late promoter and 
tripartite leader sequence. Alternatively, the vaccinia virus 7.5K promoter can be 
used (Mackett et al. Proc. Nail. Acad. ScL, USA 79:7415-7419, 1982; Mackett et al, 
J. Virol 49:857-864, 1984; Panicali et al, Proa Natl Acad ScL, USA 79:4927-4931, 
1982). Particularly useful are bovine papilloma virus vectors, which can replicate as 
extrachromosomal elements (Sarver et al, Mol Cell Biol 1 :486, 1981). Shortly after 
entry of this DNA into mouse cells, the plasmid replicates to about 100 to 200 copies 
per cell. Transcription of the inserted cDNA does not require integration of the 
plasmid into the host cell chromosome, thereby yielding a high level of expression. 
These vectors can be used for stable expression by including a selectable marker in 
the plasmid, such as, for example, the neo gene. Alternatively, the retroviral genome 
can be modified for use as a vector capable of introducing and directing the 
expression of the Cdsl protein gene in host cells (Cone and Mulligan, Proc. Natl 
Acad. Set, USA 81:6349-6353, 1984). High level expression can also be achieved 
using inducible promoters, including, but not limited to, the metallothionein IIA 
promoter and heat shock promoters. 

[0078] For long term, high yield production of recombinant proteins, stable 

expression is preferred. Rather than using expression vectors which contain viral 
origins of replication, host cells can be transformed with Cdsl protein cDNA 
controlled by appropriate expression control elements such as promoter, enhancer, 
sequences, transcription terminators, and polyadenylation sites, and a selectable 
marker. The selectable marker in the recombinant plasmid can confer resistance to 
the selection, and allows cells to stably integrate the plasmid into their chromosomes 
and grow to form foci, which, in turn can be cloned and expanded into cell lines. For 
example, following the introduction of foreign DNA, engineered cells can be allowed 
to grow for 1 to 2 days in an enriched media, and then are switched to a selective 
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media. A number of selection systems can be used, including, but not limited to, the 
herpes simplex virus thymidine kinase (Wigler et al, Cell 1 1 :223, 1977), 
hypoxanthine-guanine phosphoribosyltransferase (Szybalska and Szybalski, Proc, 
Nail Acad Set, USA 48:2026, 1982), and adenine phosphoribosyltransferase (Lowy, 
et al, Cell 22:817, 1980) genes can be employed in tk~, hgprt" or aprt" cells 
respectively. Also, antimetabolite resistance can be used as the basis of selection for 
dhfr, which confers resistance to methotrexate (Wigler, et al, Proc. Natl Acad, Sci. 
USA 77.3567, 1980; O'Hare et al, Proc. Natl Acad. Scl, USA 78: 1527, 1981); gpt, 
which confers resistance to mycophenolic acid (Mulligan and Berg, Proc. Natl Acad. 
Sci., USA 78: 2072, 1981); neo, which confers resistance to the aminoglycoside G-418 
(Colberre-Garapin et al, J. Mol BioL 150:1, 1981); and hygro, which confers 
resistance to hygromycin (Santerre et al, Gene 30:147, 1984) genes. Additional 
selectable genes, including trpB, which allows cells to utilize indole in place of 
tryptophan; hisD, which allows cells to utilize histinol in place of histidine (Hartman 
and Mulligan, Proc. Natl. Acad: Sci., USA 85:8047, 1988); and ODC (ornithine 
decarboxylase) which confers resistance to the ornithine decarboxylase inhibitor, 
2-(difluoromethyl)-DL~ornithine, DFMO (McConlogue, Curr Comm. Mol Biol. 
(Cold Spring Harbor Laboratory Press, 1987), also have been described. 

[0079] When the host is a eukaryote, such methods of transfection of DNA as 

calcium phosphate co-precipitates, conventional mechanical procedures such as 
microinjection, electroporation, insertion of a plasmid encased in liposomes, or virus 
vectors can be used Eukaryotic cells can also be cotransformed with DNA sequences 
encoding the Cdsl proteins of the invention, and a second foreign DNA molecule 
encoding a selectable phenotype, such as the herpes simplex thymidine kinase gene. 
Another method is to use a eukaryotic viral vector, such as simian virus 40 (SV40) or 
bovine papilloma virus, to transiently infect or transform eukaryotic cells and express 
the protein. (Gluzman, Eukaryotic Viral Vectors (Cold Spring Harbor Laboratory 
Press, 1982)). 

[0080] The invention provides a method for producing a polypeptide encoded 

by the nucleotide sequence of SEQ ID NO: 1 or fragments thereof, including culturing 
the host cell under conditions suitable for the expression of the polypeptide and 
recovering the polypeptide from the host cell culture. 
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[0081] A Cdsl polypeptide or a fragment thereof, can be encoded by a 

recombinant or non-recombinant nucleic acid molecule and expressed in a cell. 
Preparation of an Cdsl polypeptide by recombinant methods provides several 
advantages. In particular, the nucleic acid sequence encoding the Cdsl polypeptide can 
include additional nucleotide sequences encoding, for example, peptides useful for 
recovering the Cdsl polypeptide from the host cell. A Cdsl polypeptide can be 
recovered using well known methods, including, for example, precipitation, gel 
filtration, ion exchange, reverse-phase, or affinity chromatography (see, for example, 
Deutscher et ai, Guide to Protein Purification in Meth. EnzymoL Vol. 182, (Academic 
Press, 1990)). Such methods also can be used to purify a fragment of an polypeptide, for 
example, a particular binding sequence, from a cell in which it is naturally expressed. 

[0082] A recombinant nucleic acid molecule encoding an Cdsl polypeptide or 

a fragment thereof can include, for example, a protease site, which can facilitate 
cleavage of the Cdsl polypeptide from a non-Cdsl polypeptide sequence, for 
example, a tag peptide, secretory peptide, or the like. As such, the recombinant 
nucleic acid molecule also can encode a tag peptide such as a polyhistidine sequence, 
a FLAG peptide (Hopp et aL Biotechnology 6: 1204 (1988)), a glutathione S- 
transferase polypeptide or the like, which can be bound by divalent metal ions, a 
specific antibody (U.S. Patent No. 5,01 1,912), or glutathione, respectively, thus 
facilitating recovery and purification of the Cdsl polypeptide comprising the peptide 
tag. Such tag peptides also can facilitate identification of the Cdsl polypeptide 
through stages of synthesis, chemical or enzymatic modification, linkage, or the like. 
Methods for purifying polypeptides comprising such tags are well known in the art 
and the reagents for performing such methods are commercially available. 

J0083] A nucleic acid molecule encoding a Cdsl polypeptide can be 

engineered to contain one or more restriction endonuclease recognition and cleavage 
sites, which can facilitate, for example, substitution of an element of the Cdsl 
polypeptide such as the selective recognition domain or, where present, a spacer 
element. As such, related Cdsl polypeptides can be prepared, each having a similar 
activity, but having specificity for different fiinction-forming contexts. A restriction 
endonuclease site also can be engineered into (or out of) the sequence coding a 
peptide portion of the Xcdssl polypeptide, and can, but need not change one or more 
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amino acids encoded by the particular sequence. Such a site can provide a simple 
means to identify the nucleic acid sequence, based on cleavage (or lack of cleavage) 
following contact with the relevant restriction endonuclease, and, where introduction 
of the site changes an amino acid, can further provide advantages based on the 
substitution. 

[0084] In another series of embodiments, the present invention provides 

transgenic animal models diseases or disorders associated with mutations in the Cdsl 
protein genes. The animal may be essentially any reptile, fish, mammal, and the like. 
Preferably, the transgenic animal is mammalian including rats, mice, hamsters, guinea 
pigs, rabbits, dogs, cats, goats, sheep, pigs, and non-human primates. In addition, 
invertebrate models, including nematodes and insects, may be used for certain 
applications. The animal models are produced by standard transgenic methods 
including microinjection, transfection, or by other forms of transformation of 
embryonic stem cells, zygotes, gametes, and germ line cells with vectors including 
genomic or cDNA fragments, minigenes, homologous recombination vectors, viral 
insertion vectors and the like. Suitable vectors include vaccinia virus, adenovirus, 
adeno associated virus, retrovirus, liposome transport, neurotropic viruses, herpes 
simplex virus, and the like. The animal models may include transgenic sequences 
comprising or derived from Cdsl proteins including normal and mutant sequences, 
intronic, exonic and untranslated sequences, and sequences encoding subsets of Cdsl 
proteins such as functional domains. 

[0085] The major types of animal models provided include: (I) Animals in 

which a normal human Cdsl gene has been recombinantly introduced into the genome 
of the animal as an additional gene, under the regulation of either an exogenous or an 
endogenous promoter element, and as either a minigene or a large genomic fragment; 
in which a normal human Cdsl gene has been recombinantly substituted for one or 
both copies of the animal's homologous Cdsl gene by homologous recombination or 
gene targeting; and/or in which one or both copies of one of the animal's homologous 
Cdsl genes have been recombinantly "humanized" by the partial substitution of 
sequences encoding the human homologue by homologous recombination or gene 
targeting. (2) Animals in which a mutant Cdsl gene has been recombinantly 
introduced into the genome of the animal as an additional gene, under the regulation 
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of either an exogenous or an endogenous promoter element, and as either a minigene 
or a large genomic fragment; in which a mutant Cdsl gene has been recombinantly 
substituted for one or both copies of the animal's homologous Cdsl gene by 
homologous recombination or gene targeting; and/or in which one or both copies of 
one of the animal's homologous Cdsl genes have been recombinantly "humanized" by 
the partial substitution of sequences encoding a mutant human homologue by 
homologous recombination or gene targeting. (3) Animals in which a mutant version 
of one of that animal's Cdsl genes has been recombinantly introduced into the 
genome of the animal as an additional gene, under the regulation of either an 
exogenous or an endogenous promoter element, and as either a minigene or a large 
genomic fragment; and/or in which a mutant version of one of that animal's Cdsl 
genes has been recombinantly substituted for one or both copies of the animal's 
homologous Cdsl gene by homologous recombination or gene targeting. (4) "Knock- 
out" animals in which one or both copies of one of the animal's Cdsl genes have been 
partially or completely deleted by homologous recombination or gene targeting, or 
have been inactivated by the insertion or substitution by homologous recombination 
or gene targeting of exogenous sequences. 

[0086] In a preferred embodiment of the invention, there is provided a 

transgenic non-human animal having a transgene that expresses a Cdsl -encoding 
polynucleotide chromosomally integrated into the germ cells of the animal. Animals 
are referred to as "transgenic" when such animal has had a heterologous DNA 
sequence, or one or more additional DNA sequences normally endogenous to the 
animal (collectively referred to herein as "transgenes") chromosomally integrated into 
the germ cells of the animal. The transgenic animal (including its progeny) will also 
have the transgene fortuitously integrated into the chromosomes of somatic cells. 

[0087] Various methods to make the transgenic animals of the subject 

invention can be employed. Generally speaking, three such methods may be 
employed. In one such method, an embryo at the pronuclear stage (a "one cell 
embryo") is harvested from a female and the transgene is microinjected into the 
embryo, in which case the transgene will be chromosomally integrated into both the 
germ cells and somatic cells of the resulting mature animal. In another such method, 
embryonic stem cells are isolated and the transgene incorporated therein by 
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electroporation, plasmid transfection or microinjection, followed by reintroduction of 
the stem cells into the embryo where they colonize and contribute to the germ line. 
Methods for microinjection of mammalian species is described in United States Patent 
No. 4,873,191. In yet another such method, embryonic cells are infected with a 
retrovirus containing the transgene whereby the germ cells of the embryo have the 
transgene chromosomally integrated therein. When the animals to be made transgenic 
are avian, because avian fertilized ova generally go through cell division for the first 
twenty hours in the oviduct, microinjection into the pronucleus of the fertilized egg is 
problematic due to the inaccessibility of the pronucleus. Therefore, of the methods to 
make transgenic animals described generally above, retrovirus infection is preferred 
for avian species, for example as described in U.S. Patent No. 5,162,215. If 
microinjection is to be used with avian species, however, a recently published 
procedure by Love et aL y (Biotechnology, 12, Jan 1994) can be utilized whereby the 
embryo is obtained from a sacrificed hen approximately two and one-half h after the 
laying of the previous laid egg, the transgene is microinjected into the cytoplasm of 
the germinal disc and the embryo is cultured in a host shell until maturity. When the 
animals to be made transgenic are bovine or porcine, microinjection can be hampered 
by the opacity of the ova thereby making the nuclei difficult to identify by traditional 
differential interference-contrast microscopy. To overcome this problem, the ova can 
first be centrifuged to segregate the pronuclei for better visualization. 

[0088] The non-human animals of the invention are murine typically {e.g., 

mouse). The transgenic non-human animals of the invention are produced by 
introducing "transgenes" into the germline of the non-human animal. Embryonal 
target cells at various developmental stages can be used to introduce transgenes. 
Different methods are used depending on the stage of development of the embryonal 
target cell. The zygote is the best target for microinjection. The use of zygotes as a 
target for gene transfer has a major advantage in that in most cases the injected DNA 
will be incorporated into the host gene before the first cleavage (Brinster et al, Proa 
Natl Acad ScL USA 82:4438-4442, 1985). As a consequence, all cells of the 
transgenic non-human animal will carry the incorporated transgene. This will in 
general also be reflected in the efficient transmission of the transgene to offspring of 
the founder since 50% of the germ cells will harbor the transgene. 
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[0089] The term "transgenic" is used to describe an animal which includes 

exogenous genetic material within all of its cells. A "transgenic" animal can be 
produced by cross-breeding two chimeric animals which include exogenous genetic 
material within cells used in reproduction. Twenty-five percent of the resulting 
offspring will be transgenic i.e. y animals which include the exogenous genetic 
material within all of their cells in both alleles. 50% of the resulting animals will 
include the exogenous genetic material within one allele and 25% will include no 
exogenous genetic material. 

[0090] In the microinjection method useful in the practice of the subject 

invention, the transgene is digested and purified free from any vector DNA e.g. by gel 
electrophoresis. It is preferred that the transgene include an operatively associated 
promoter which interacts with cellular proteins involved in transcription, ultimately 
resulting in constitutive expression. Promoters useful in this regard include those 
from cytomegalovirus (CMV), Moloney leukemia virus (MLV), and herpes virus, as 
well as those from the genes encoding metallothionine, skeletal actin, P-enolpyruvate 
carboxylase (PEPCK), phosphoglycerate (PGK), DHFR, and thymidine kinase. 
Promoters for viral long terminal repeats (LTRs) such as Rous Sarcoma Virus can also 
be employed. Constructs useful in plasmid transfection of embryonic stem cells will 
employ additional regulatory elements well known in the art such as enhancer 
elements to stimulate transcription, splice acceptors, termination and polyadenylation 
signals, and ribosome binding sites to permit translation. 

[0091] Retroviral infection can also be used to introduce transgene into a 

non-human animal, as described above. The developing non-human embryo can be 
cultured in vitro to the blastocyst stage. During this time, the blastomeres can be 
targets for retro viral infection (Jaenich, (1 976) Proa Natl Acad. Sci USA 
73 : 1 260- 1 264). Efficient infection of the blastomeres is obtained by enzymatic 
treatment to remove the zona pellucida (Hogan, et al. (1986) in Manipulating the 
Mouse Embryo, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N. Y). 
The viral vector system used to introduce the transgene is typically a 
replication-defective retro virus carrying the transgene (Jahner, et al t (1985) Proc. 
Natl Acad Sci. USA 82:6927-693 1 ; Van der Putten, et al } (1985) Proc. Natl. Acad 
Sci USA 82:6148-6152, ). Transfection is easily and efficiently obtained by culturing 
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the blastomeres on a monolayer of virus-producing cells (Van der Putten, supra', 
Stewart, etal, (1987) EMBO J. 6:383-388) . Alternatively, infection can be 
performed at a later stage. Virus or virus-producing cells can be injected into the 
blastocoel (Jahner et al (1982) Nature 298:623-628). Most of the founders will be 
mosaic for the transgene since incorporation occurs only in a subset of the cells which 
formed the transgenic nonhuman animal. Further, the founder may contain various 
retro viral insertions of the transgene at different positions in the genome which 
generally will segregate in the offspring. In addition, it is also possible to introduce 
transgenes into the germ line, albeit with low efficiency, by intrauterine retroviral 
infection of the midgestation embryo (Jahner et al, supra). 

[0092] A third type of target cell for transgene introduction is the embryonal 

stem cell (ES). ES cells are obtained from pre-implantation embryos cultured in vitro 
and fused with embryos (M J. Evans et al Nature 292: 154-1 56, 1981 ; M.O. Bradley 
etal, Nature 309: 255-258, 1984; Gossler, et al, Proa Natl Acad Sci USA 83: 
9065-9069, 1986; and Robertson etal f Nature 322:445-448, 1986). Transgenes can 
be efficiently introduced into the ES cells by DNA transfection or by retro 
virus-mediated transduction. Such transformed ES cells can thereafter be combined 
with blastocysts from a nonhuman animal. The ES cells thereafter colonize the 
embryo and contribute to the germ line of the resulting chimeric animal. (For review 
see Jaenisch, R., Science 240: 1468-1474, 1988). 

[0093] 'Transformed" means a cell into which (or into an ancestor of which) 

has been introduced, by means of recombinant nucleic acid techniques, a heterologous 
nucleic acid molecule. "Heterologous" refers to a nucleic acid sequence that either 
originates from another species or is modified from either its original form or the form 
primarily expressed in the cell. 

[0094] "Transgene" means any piece of DNA which is inserted by artifice into 

a cell, and becomes part of the genome of the organism (i.e., either stably integrated 
or as a stable extrachromosomal element) which develops from that cell. Such a 
transgene may include a gene which is partly or entirely heterologous (i.e., foreign) to 
the transgenic organism, or may represent a gene homologous to an endogenous gene 
of the organism. Included within this definition is a transgene created by the 
providing of an RNA sequence which is transcribed into DNA and then incorporated 
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into the genome. The transgenes of the invention include DNA sequences which 
encode Cdsl polypeptide-sense and antisense polynucleotides, which may be 
expressed in a transgenic non-human animal. The term "transgenic" as used herein 
additionally includes any organism whose genome has been altered by in vitro 
manipulation of the early embryo or fertilized egg or by any transgenic technology to 
induce a specific gene knockout. As used herein, the term "transgenic" includes any 
transgenic technology familiar to those in the art which can produce an organism 
carrying an introduced transgene or one in which an endogenous gene has been 
rendered non-functional or "knocked out". 

[0095] Another embodiment of the invention provides a computer readable 

medium having store thereon a nucleic acid sequence selected from the group 
consisting of SEQ ED NO:l and sequences substantially identical thereto, or a 
polypeptide sequence selected from the group consisting of SEQ ID NO:2 and 
sequences substantially identical thereto. 

[0096] A further embodiment of the invention provides a computer system 

comprising a processor and a data storage device wherein said date storage device has 
stored thereon a nucleic acid sequence selected from the group consisting of SEQ ID 
NO:l, and sequences substantially identical thereto, or a polypeptide sequence 
selected from the group consisting of SEQ ID NO:2, and sequences substantially 
identical thereto. The computer system, additionally can contain a sequence 
comparison algorithm and a data storage device having at least one reference 
sequence stored on it. The sequence comparison algorithm comprises a computer 
program which indicates polymorphisms. The term "polymorphism", as used herein, 
refers to the existence of multiple alleles at a single locus. Polymorphism can be are 
several types including, for example, those that change DNA sequence but do not 
change protein sequence, those that change protein sequence without changing 
function, those that create proteins with a different activity, and those that create 
proteins that are non-functional. 

[0097] Embodiments of the invention include systems (e.g., internet based 

systems), particularly computer systems which store and manipulate the coordinate 
and sequence information described herein. One example of a computer system 1 00 
is illustrated in block diagram form in Figure 3. As used herein, "a computer system" 
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refers to the hardware components, software components, and data storage 
components used to analyze coordinates and sequences information. The computer 
system 100 typically includes a processor for processing, accessing and manipulating 
the sequence data. The processor 105 can be any well-known type of central 
processing unit, such as, for example, the Pentium III from Intel Corporation, or 
similar processor from Sun, Motorola, Compaq, AMD or International Business 
Machines. 

[0098] Typically the computer system 100 is a general purpose system that 

comprises the processor 105 and one or more internal data storage components 1 10 
for storing data, and one or more data retrieving devices for retrieving the data stored 
on the data storage components. A skilled artisan can readily appreciate that any one 
of the currently available computer systems are suitable. 

[0099] In one particular embodiment, the computer system 100 includes a 

processor 105 connected to a bus which is connected to a main memory 115 
(preferably implemented as RAM) and one or more internal data storage devices 1 10, 
such as a hard drive and/or other computer readable media having data recorded 
thereon. In some embodiments, the computer system 100 further includes one or 
more data retrieving device 1 1 8 for reading the data stored on the internal data storage 
devices 1 10. 

[0100J The data retrieving device 118 may represent, for example, a floppy 

disk drive, a compact disk drive, a magnetic tape drive, or a modem capable of 
connection to a remote data storage system (e.g., via the internet) etc. In some 
embodiments, the internal data storage device 1 10 is a removable computer readable 
medium such as a floppy disk, a compact disk, a magnetic tape, etc. containing 
control logic and/or data recorded thereon. The computer system 100 may 
advantageously include or be programmed by appropriate software for reading the 
control logic and/or the data from the data storage component once inserted in the 
data retrieving device. 

[0101] The computer system 100 includes a display 120 which is used to 

display output to a computer user. It should also be noted that the computer system 
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100 can be linked to other computer systems 125a-c in a network or wide area 
network to provide centralized access to the computer system 100. 

[0102] Figure 4 is a flow diagram illustrating one embodiment of a process 

200 for comparing a new nucleotide or protein sequence with a database of sequences 
in order to determine the homology levels between the new sequence and the 
sequences in the database. The database of sequences can be a private database stored 
within the computer system 100, or a public database such as GENBANK that is 
available through the Internet. 

[0103] The process 200 begins at a start state 201 and then moves to a state 

202 wherein the new sequence to be compared is stored to a memory in a computer 
system 100. As discussed above, the memory could be any type of memory, 
including RAM or an internal storage device. 

[0104] The process 200 then moves to a state 204 wherein a database of 

sequences is opened for analysis and comparison. The process 200 then moves to a 
state 206 wherein the first sequence stored in the database is read into a memory on 
the computer. A comparison is then performed at a state 210 to determine if the first 
sequence is the same as the second sequence. It is important to note that this step is 
not limited to performing an exact comparison between the new sequence and the first 
sequence in the database. Well-known methods are known to those of skill in the art 
for comparing two nucleotide or protein sequences, even if they are not identical. For 
example, gaps can be introduced into one sequence in order to raise the homology 
level between the two tested sequences. The parameters that control whether gaps or 
other features are introduced into a sequence during comparison are normally entered 
by the user of the computer system. 

[0105] Once a comparison of the two sequences has been performed at the 

state 210, a determination is made at a decision state 210 whether the two sequences 
are the same. Of course, the term "same" is not limited to sequences that are 
absolutely identical. Sequences that are within the homology parameters entered by 
the user will be marked as "same" in the process 200. 

[0106] If a determination is made that the two sequences are the same, the 

process 200 moves to a state 214 wherein the name of the sequence from the database 
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is displayed to the user. This state notifies the user that the sequence with the 
displayed name fulfills the homology constraints that were entered. Once the name of 
the stored sequence is displayed to the user, the process 200 moves to a decision state 
218 wherein a determination is made whether more sequences exist in the database. 
If no more sequences exist in the database, then the process 200 terminates at an end 
state 220. However, if more sequences do exist in the database, then the process 200 
moves to a state 224 wherein a pointer is moved to the next sequence in the database 
so that it can be compared to the new sequence. In this manner, the new sequence is 
aligned and compared with every sequence in the database. 

[0107] It should be noted that if a determination had been made at the decision 

state 212 that the sequences were not homologous, then the process 200 would move 
immediately to the decision state 218 in order to determine if any other sequences 
were available in the database for comparison. 

[0108] Figure 5 is a flow diagram illustrating one embodiment of a process 

250 in a computer for determining whether two sequences are homologous. The 
process 250 begins at a start state 252 and then moves to a state 254 wherein a first 
sequence to be compared is stored to a memory. The second sequence to be 
compared is then stored to a memory at a state 256. The process 250 then moves to a 
state 260 wherein the first character in the first sequence is read and then to a state 
262 wherein the first character of the second sequence is read. It should be 
understood that if the sequence is a nucleotide sequence, then the character would 
normally be either A, T, C, G or U. If the sequence is a protein sequence, then it is 
preferably in the single letter amino acid code so that the first and sequence sequences 
can be easily compared. 

[0109] A determination is then made at a decision state 264 whether the two 

characters are the same. If they are the same, then the process 250 moves to a state 
268 wherein the next characters in the first and second sequences are read. A 
determination is then made whether the next characters are the same. If they are, then 
the process 250 continues this loop until two characters are not the same. If a 
determination is made that the next two characters are not the same, the process 250 
moves to a decision state 274 to determine whether there are any more characters 
either sequence to read. 
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[0110] If there are not any more characters to read, then the process 250 

moves to a state 276 wherein the level of homology between the first and second 
sequences is displayed to the user. The level of homology is determined by 
calculating the proportion of characters between the sequences that were the same out 
of the total number of sequences in the first sequence. Thus, if every character in a 
first 100 nucleotide sequence aligned with a every character in a second sequence, the 
homology level would be 100%. 

[01 1 1 J Homology or identity is often measured using sequence analysis 

software (e.g.. Sequence Analysis Software Package of the Genetics Computer Group, 
University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, 
WI 53705). Such software matches similar sequences by assigning degrees of 
homology to various deletions, substitutions and other modifications. The terms 
"homology" and "identity" in the context of two or more nucleic acids or polypeptide 
sequences, refer to two or more sequences or subsequences that are the same or have a 
specified percentage of amino acid residues or nucleotides that are the same when 
compared and aligned for maximum correspondence over a comparison window or 
designated region as measured using any number of sequence comparison algorithms 
or by manual alignment and visual inspection. 

[01121 F° r sequence comparison, typically one sequence acts as a reference 

sequence, to which test sequences are compared. When using a sequence comparison 
algorithm, test and reference sequences are entered into a computer, subsequence 
coordinates are designated, if necessary, and sequence algorithm program parameters 
are designated. Default program parameters can be used, or alternative parameters 
can be designated. The sequence comparison algorithm then calculates the percent 
sequence identities for the test sequences relative to the reference sequence, based on 
the program parameters. 

[0113] A "comparison window", as used herein, includes reference to a 

segment of any one of the number of contiguous positions selected from the group 
consisting of from 20 to 600, usually about 50 to about 200, more usually about 100 
to about 1 50 in which a sequence may be compared to a reference sequence of the 
same number of contiguous positions after the two sequences are optimally aligned. 
Methods of alignment of sequence for comparison are well-known in the art. Optimal 
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alignment of sequences for comparison can be conducted, e.g., by the local homology 
algorithm of Smith & Waterman, Adv. Appl. Math. 2:482, 1981, by the homology 
alignment algorithm of Needleman & Wunsch, J. Mol. Biol. 48:443, 1970, by the 
search for similarity method of person & Lipman, Proc. Natl. Acad. Sci. USA 
85:2444, 1988, by computerized implementations of these algorithms (GAP, 
BESTFIT, FAST A, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment 
and visual inspection. Other algorithms for determining homology or identity include, 
for example, in addition to a BLAST program (Basic Local Alignment Search Tool at 
the National Center for Biological Information), ALIGN, AMAS (Analysis of 
Multiply Aligned Sequences), AMPS (Protein Multiple Sequence Alignment), 
ASSET (Aligned Segment Statistical Evaluation Tool), BANDS, BESTSCOR, 
BIOSCAN (Biological Sequence Comparative Analysis Node), BLIMPS (BLocks 
IMProved Searcher), FASTA, Intervals & Points, BMB, CLUSTAL V, CLUSTAL 
W, CONSENSUS, LCONSENSUS, WCONSENSUS, Smith- Waterman algorithm, 
DARWIN, Las Vegas algorithm, FNAT (Forced Nucleotide Alignment Tool), 
Framealign, Framesearch, DYNAMIC, FILTER, FSAP (Fristensky Sequence 
Analysis Package), GAP (Global Alignment Program), GENAL, GIBBS, GenQuest, 
ISSC (Sensitive Sequence Comparison), LALIGN (Local Sequence Alignment), LCP 
(Local Content Program), MACAW (Multiple Alignment Construction & Analysis 
Workbench), MAP (Multiple Alignment Program), MBLKP, MBLKN, PIMA 
(Pattern-Induced Multi-sequence Alignment), SAGA (Sequence Alignment by 
Genetic Algorithm) and WHAT-IF. Such alignment programs can also be used to 
screen genome databases to identify polynucleotide sequences having substantially 
identical sequences. A number of genome databases are available, for example, a 
substantial portion of the human genome is available as part of the Human Genome 
Sequencing Project (see J. Roach, at the uniform resource locator (url) 
weber.u. Washington.edu/~roach/hurnan_ genome_ progress 2.html) (Gibbs, 1995). 
At least twenty-one other genomes have already been sequenced, including, for 
example, M genitalium (Fraser et al y 1995), M. jannaschii (Bult et al y 1 996), H. 
influenzae (Fleischmann et al y 1 995), K coli (Blattner et al y 1 997), and yeast (S. 
cerevisiae) (Mewes et al. y 1997), and D. melanogaster (Adams et al y 2000). 
Significant progress has also been made in sequencing the genomes of model 
organism, such as mouse, C. elegans, and Arabadopsis sp. Several databases 
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containing genomic information annotated with some functional information are 
maintained by different organization, and are accessible via the internet at world wide 
web sites such as: tigr.org/tdb, .genetics.wisc.edu; genome-.stanford.edu/~ball; hiv- 
web.lanl.gov; ncbi.nlm.nih.gov; ebi.ac.uk; Pasteur. fr/other/biology; and 
genone. wi . mit . edu . 

[0114] One example of a useful algorithm is BLAST and BLAST 2.0 

algorithms, which are described in Altschul etal, Nuc. Acids Res. 25:3389-3402, 
1977, and Altschul etal y J. Mol Biol. 215:403-410, 1990, respectively. Software for 
performing BLAST analyses is publicly available through the National Center for 
Biotechnology Information (NCBI; world wide web address ncbi.nlm.nih.gov). This 
algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying 
short words of length W in the query sequence, which either match or satisfy some 
positive-valued threshold score T when aligned with a word of the same length in a 
database sequence. T is referred to as the neighborhood word score threshold 
(Altschul et al. y supra). These initial neighborhood word hits act as seeds for initiating 
searches to find longer HSPs containing them. The word hits are extended in both 
directions along each sequence for as far as the cumulative alignment score can be 
increased. Cumulative scores are calculated using, for nucleotide sequences, the 
parameters M (reward score for a pair of matching residues:, always >0). For amino 
acid sequences, a scoring matrix is used to calculate the cumulative score. Extension 
of the word hits in each direction are halted when: the cumulative alignment score 
falls off by the quantity X from its maximum achieved value; the cumulative score 
goes to zero or below, due to the accumulation of one or more negative-scoring 
residue alignments; or the end of either sequence is reached. The BLAST algorithm 
parameters W, T, and X determine the sensitivity and speed of the alignment. The 
BLASTN program (for nucleotide sequences) uses as defaults a word length (W) of 
1 1, an expectation (E) of 10, M=5, N=-4 and a comparison of both strands. For amino 
acid sequences, the BLASTP program uses as defaults a word length of 3, and 
expectations (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff & Henikoff, 
Proc. Natl. Acad. Sci. USA 89:10915, 1989) alignments (B) of 50, expectation (E) of 
10, M^5, N= -4, and a comparison of both strands. 
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[0115] The BLAST algorithm also performs a statistical analysis of the 

similarity between two sequences (see, e.g., Karlin & Altschul, Proc. Natl. Acad. Sci. 
USA 90:5873, 1993). One measure of similarity provided by BLAST algorithm is the 
smallest sum probability (P(N)), which provides an indication of the probability by 
which a match between two nucleotide or amino acid sequences would occur by 
chance. For example, a nucleic acid is considered similar to a references sequence if 
the smallest sum probability in a comparison of the test nucleic acid to the reference 
nucleic acid is less than about 0.2, more preferably less than about 0.01, and most 
preferably less than about 0.001 . 

[0116] In one embodiment, protein and nucleic acid sequence homologies are 

evaluated using the Basic Local Alignment Search Tool ("BLAST") In particular, five 
specific BLAST programs are used to perform the following task: 

(1) BLASTP and BLAST3 compare an amino acid query sequence 
against a protein sequence database; 

(2) BLASTN compares a nucleotide query sequence against a 
nucleotide sequence database; 

(3) BLASTX compares the six-frame conceptual translation 
products of a query nucleotide sequence (both strands) against a protein sequence 
database; 

(4) TBLASTN compares a query protein sequence against a 
nucleotide sequence database translated in all six reading frames (both strands); and 

(5) TBLASTX compares the six-frame translations of a nucleotide 
query sequence against the six-frame translations of a nucleotide sequence database. 

[0117] The BLAST programs identify homologous sequences by identifying 

similar segments, which are referred to herein as "high-scoring segment pairs," 
between a query amino or nucleic acid sequence and a test sequence which is 
preferably obtained from a protein or nucleic acid sequence database. High-scoring 
segment pairs are preferably identified (i.e., aligned) by means of a scoring matrix, 
many of which are known in the art. Preferably, the scoring matrix used is the 
BLOSUM62 matrix (Gonnet et al y Science 256: 1443-1445, 1992; Henikoff and 
Henikoff, Proteins 17:49-61, 1993). Less preferably, the PAM or PAM250 matrices 
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may also be used (see, e.g., Schwartz and Dayhoff, eds., 1978, Matrices for Detecting 
Distance Relationships: Atlas of Protein Sequence and Structure, Washington: 
National Biomedical Research Foundation). BLAST programs are accessible through 
the U.S. National Library of Medicine site on the world wide web, for example 

[0118] The parameters used with the above algorithms may be adapted 

depending on the sequence length and degree of homology studied. In some 
embodiments, the parameters may be the default parameters used by the algorithms in 
the absence of instructions from the user. 

[0119] A method is provided for identifying a reagent that phosphorylation of 

a Cdsl polypeptide. The method includes incubating the reagent with a Cdsl 
polypeptide, and one or more oligonucleotides that can form double-stranded DNA 
under conditions sufficient to permit the components to interact with each other, and 
comparing the phosphorylation status of the Cdsl polypeptide incubated with the 
reagent with the phosphorylation status of a Cdsl polypeptide not incubated with the 
reagent. 

[0120] Modulation of phosphorylation may be an increase in phosphorylation 

or a decrease in phosphorylation. Various amino acid residues can be phosphofylated. 
Phosphorylation is the addition of a phosphate group, e.g.PDt, by esterification with 
the hydroxyl group of an amino acid. Amino acids serine and threonine are typically 
phosphorylated with serine preferred. 

[0121] The reagent can be selected from the species consisting of a peptide, a 

peptidomimetic, a polypeptide, a pharmaceutical, a chemical compound, an 
oligonucleotide, a polynucleotide and an antibody. 

[01 22] When the Cdsl polypeptide is Xenopus Cdsl, a reagent includes one 

or more oligonucleotides. In one embodiment of the invention oligonucleotides 
having the sequences set forth in SEQ ID NO 3 and SEQ ID NO:4 modulate 
phosphorylation. In another embodiment of the invention, a nucleotide having the 
sequence set forth in SEQ ID NO: 5 modulates phosphorylation. Any oligonucleotides 
that can form a double-stranded DNA molecule can modulate phosphorylation of 
Cdsl. 
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[0123] Also provided by the invention is a method of modulating cell cycle 

progression in a cell. The method includes providing to the cell a reagent that affects 
the activity or expression of a Cdsl polypeptide. Modulation of cell cycle progression 
can be a speeding up of the cell cycle or it may be an inhibition or reduction in the 
progression of the cell cycle. An inhibition of the cell cycle can include an increase in 
the time a cell is in a gap phase, e.g., GO, the pause after mitosis, Gl, the gap after 
mitosis before DNA synthesis starts or G2, the gap after DNA synthesis is complete 
and before mitosis and cell division begins. The gap phases allow for checkpoints, 
stopping points in the cell cycle where progress can be halted. During this time, 
cellular mechanisms insure that DNA is not damaged, or incompletely or incorrectly 
replicated. Speeding up the cell cycle could allow rounds of mitosis to proceed in a 
shorter period of time. This may result in damaged DNA which, following mitosis, 
could have serious or fatal effects for the daughter cells. 

[0124] Any Cdsl protein may be employed in invention methods. In certain 

embodiments a Xenopus Cdsl protein according to the amino acid sequence set forth 
in SEQ ID NO:2 is used. ^ 

[01 25] The cell may be any cell of interest, including but not limited to 

neuronal cells, glial cells, cardiac cells, bronchial cells, uterine cells, testicular cells, 
liver cells, renal cells, intestinal cells, cells from the thymus and spleen, placental 
cells, endothelial cells, endocrine cells including thyroid, parathyroid, pituitary and 
the like, smooth muscle cells and skeletal muscle cells. The term "incubating" 
includes conditions which allow contact between the test compound and the cell of 
interest. "Contacting" may include in solution or in solid phase. 

[0126] Reagents which modulate cell cycle progression can include peptides, 

peptidomimetics, polypeptides, pharmaceuticals, chemical compounds and biological 
agents, for example. Antibodies and combinatorial compound libraries can also be 
tested using the method of the invention. One class of organic molecules, preferably 
small organic compounds having a molecular weight of more than 50 and less than 
about 2,500 Daltons. Candidate agents comprise functional groups necessary for 
structural interaction with proteins, particularly hydrogen bonding, and typically 
include at least an amine, carbonyl, hydroxyl or carboxyl group, preferably at least 
two of the functional chemical groups. The candidate agents often comprise cyclical 
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carbon or heterocyclic structures and/or aromatic or polyaromatic structures 
substituted with one or more of the above functional groups. 

[0127J The test agent may also be a combinatorial library for screening a 

plurality of compounds. Compounds such as peptides identified in the method of the 
invention can be further cloned, sequenced, and the like, either in solution of after 
binding to a solid support, by any method usually applied to the isolation of a specific 
DNA sequence Molecular techniques for DNA analysis (Landegren et al. y Science 
242:229-237, 1988) and cloning have been reviewed (Sambrook et al f Molecular 
Cloning: a Laboratory Manual 2nd Ed.; Cold Spring Harbor Laboratory Press, 
Plainview, NY, 1998, herein incorporated by reference). 

[0128] Candidate compounds are obtained from a wide variety of sources 

including libraries of synthetic or natural compounds. For example, numerous means 
are available for random and directed synthesis of a wide variety of organic 
compounds and biomolecules, including expression of randomized oligonucleotides 
and oligopeptides. Alternatively, libraries of natural compounds in the form of 
bacterial, fungal, plant and animal extracts are available or readily produced. 
Additionally, natural or synthetically produced libraries and compounds are readily 
modified through conventional chemical, physical and biochemical means, and may 
be used to produce combinatorial libraries. Known pharmacological agents may be 
subjected to directed or random chemical modifications, such as acylation, alkylation, 
esterification, amidification, etc., to produce structural analogs. Candidate agents are 
also found among biomolecules including, but not limited to: peptides, saccharides, 
fatty acids, steroids, purines, pyrimidines, derivatives, structural analogs or 
combinations thereof. 

[0129] A variety of other agents may be included in the 

screening/identification assay. These include agents like salts, neutral proteins, e.g., 
albumin, detergents, etc. that are used to facilitate optimal protein-protein binding 
and/or reduce nonspecific or background interactions. Reagents that improve the 
efficiency of the assay, such as protease inhibitors, nuclease inhibitors, antimicrobial 
agents and the like may be used. The mixture of components are added in any order 
that provides for the requisite binding. Incubations are performed at any suitable 
temperature, typically between 4 and 40°C. Incubation periods are selected for 
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optimum activity, but may also be optimized to facilitate rapid high-throughput 
screening. Typically between 0.1 and 10 h will be sufficient. 

[0130] The term "modulating the cell cycle progression' 7 refers to altering the 

cell cycle by inhibiting its progress or by stimulating its progress. For example, 
inhibition of the cell cycle can be accomplished by inducing a gap phase that would 
not have occurred in the absence of modulation or increasing the duration of a gap 
phase relative to the duration in the absence of modulation. Stimulation of the cell 
cycle can be accomplished by eliminating a gap phase that would have occurred in the 
absence of modulation or decreasing the duration of a gap phase relative to the 
duration in the absence of modulation. 

|0131) The term "reagent" as used herein describes any molecule, e.g., 

protein, nucleic acid, or pharmaceutical, with the capability of altering the expression 
of a Cdsl polynucleotide or activity of Cdsl polypeptide. Candidate reagents include 
nucleic acids encoding a Cdsl polypeptide, or that interfere with expression of a Cdsl 
polypeptide, such as an antisense nucleic acid, ribozymes, and the like Candidate 
reagents further include antibodies that specifically recognize Cdsl polypeptides. 
Candidate reagents also encompass numerous chemical classes wherein the agent 
modulates Cdsl expression or activity. One exemplary reagent is a double stranded 
DNA molecule. 

J01 32) In yet another embodiment of the invention, there is provided a method 

of treating a disorder associated with cell cycle progression. The -method includes 
administering to a subject in need thereof a therapeutically effective amount of a 
compound that modulates a Cdsl polypeptide activity. 

101 33] In another embodiment of the invention, there is provided a method of 

treating a subject having a disorder associated with increased cell cycle progression 
compared to a subject not having the disorder. The method includes administering to 
a subject having the disorder a therapeutically effective amount of a compound that 
increases a Cdsl polypeptide activity. 

J0134] Treatment of a disorder associated with cell cycle progression may 

influence phosphorylation of Cdsl, activation of Cdsl, phosphorylation of Cdc25, cell 
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number, rounds of mitosis, DNA repair, functional properties of Cdsl proteins, and 
the like. 

[0135] Modulation of a Cdsl polypeptide activity envisions the suppression 

of Cdsl activity or expression when Cdsl is overexpressed or has an increased 
activity as compared to a control. The term "modulate" also includes the 
augmentation of the expression of Cdsl polypeptide when it is underexpressed or has 
a decreased activity as compared to a control. 

[0136] The disorder can be a cell proliferative disorder. A cell proliferative 

disorder as described herein may be a neoplasm. Such neoplasms are either benign or 
malignant. The term "neoplasm" refers to a new, abnormal growth of cells or a 
growth of abnormal cells that reproduce faster than normal. A neoplasm creates an 
unstructured mass (a tumor) which can be either benign or malignant. The term 
"benign" refers to a tumor that is non-cancerous, e.g. its cells do not invade 
surrounding tissues or metastasize to distant sites. The term "malignant" refers to a 
tumor that is metastastic, invades contiguous tissue or no longer under normal cellular 
growth control. Neoplasms are generally derived from cells that normally maintain a 
proliferative capacity; almost every cell and tissue type can give rise to a neoplasm. 
For example, a mutation in the gene encoding CHK2, the human homologue of 
XCdsl, appears to be involved in small cell lung cancer (Haruki et al (2000) Cancer 
Research 60:4689-4692). 

[0137] Where a disorder is associated with the increased expression of a Cdsl 

polypeptide, nucleic acid sequences that interfere with the expression of a Cdsl 
polypeptide can be used (see Kushner and Silverman (2000) Curr. Oncol Reports, 
2:23-30, herein incorporated by reference). In this manner, phosphorylation of a Cdsl 
or Cdc25 protein, cell cycle progression, and the like can be modulated. This 
approach also utilizes, for example, antisense nucleic acid, ribozymes, or triplex 
agents to block transcription or translation of a Cdsl mRNA, either by masking that 
mRNA with an antisense nucleic acid or triplex agent, or by cleaving it with a 
ribozyme in disorders associated with increased Cdsl expression. Alternatively, a 
dominant negative form of a Cdsl polypeptide could be administered. 
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[0138] When Cdsl is overexpressed, candidate agents include antisense 

nucleic acid sequences. Antisense nucleic acids are DNA or RNA molecules that are 
complementary to at least a portion of a specific mRNA molecule (Weintraub (1990) 
Scientific American^ 262:40). In the cell, the antisense nucleic acids hybridize to the 
corresponding mRNA, forming a double-stranded molecule. The antisense nucleic 
acids interfere with the translation of the mRN A, since the cell will not translate a 
mRNA that is double-stranded. Antisense oligomers of about 15 nucleotides are 
preferred, since they are easily synthesized and are less likely to cause problems than 
larger molecules when introduced into the target cell. The use of antisense methods 
to inhibit the in vitro translation of genes is well known in the art (Marcus-Sakura, 
(\9%%)AnalBiochem.. 172:289). 

10139] Use of an oligonucleotide to stall transcription is known as the triplex 

strategy since the oligomer winds around double-helical DNA, forming a three-strand 
helix. Therefore, these triplex compounds can be designed to recognize a unique site 
on a chosen gene (Maher et a/. 7 (1991) Antisense Res. andDev^ 1:227, Helene, 
(1 991) Anticancer Drug Design, 6: 569). 

|0140] Ribozymes are RNA molecules possessing the ability to specifically 

cleave other single-stranded RNA in a manner analogous to DNA restriction 
endonucleases. Through the modification of nucleotide sequences which encode 
these RNAs, it is possible to engineer molecules that recognize specific nucleotide 
sequences in an RNA molecule and cleave it (Cech, 1988, J. Amer. Med. Assn. y 
260:3030). A major advantage of this approach is that, because they are 
sequence-specific, only mRNAs with particular sequences are inactivated. 

[0141] There are two basic types of ribozymes namely, tetrahymena-type, 

(HasselhofF (1 988) Nature , 334:585) and "hammerhead'Mype. Tetrahymena-type 
ribozymes recognize sequences which are four bases in length, while 
"hammerhead"-type ribozymes recognize base sequences 11-18 bases in length. The 
longer the recognition sequence, the greater the likelihood that the sequence will 
occur exclusively in the target mRNA species. Consequently, hammerhead-type 
ribozymes are preferable to tetrahymena-type ribozymes for inactivating a specific 
mRNA species and 18-based recognition sequences are preferable to shorter 
recognition sequences. 
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[0142] When a disorder is associated with the decreased expression of a Cdsl 

polypeptide, nucleic acid sequences that encode a Cdsl polypeptide can be used. An 
agent which modulates Cdsl expression includes a polynucleotide encoding a 
polypeptide of SEQ ID N02, SEQ ED NO:4, or a conservative variant thereof 
Alternatively, an agent of use with the subject invention includes reagents that 
increase the expression of a polynucleotide encoding Cdsl or a reagent that increases 
the activity of a Cdsl polypeptide. 

10143] The following examples are intended to illustrate but not limit the 

invention. 

EXAMPLE 1 

Isolation of a cDNA Encoding Xenopus Cdsl Homologue 

[0144] To study the functional properties of Cdsl in Xenopus egg extracts, we 

isolated a cDNA encoding Xenopus Cdsl by using database analysis, polymerase 
chain reaction (PCR) amplification, and library screening. 

[0145] An internal fragment of a cDNA encoding Xcdsl was obtained by 

polymerase chain reaction (PCR) using the degenerate oligonucleotides 
AA(T/C)GGIACIT(T/G)(T/C/G)1T1AA (SEQ ID NO:6) and 
AT1A(G/A)I AT(A/G)TTITCIGG(T/C)TT1AI(A/G)TCTC(T/G)(A/G)TG (SEQ ID 
NO:7), which were designed according to conserved areas found in Cdsl 
homologues. The PCR reactions contained Xenopus oocyte cDNAs as template, 50 
pmol of the degenerate oligonucleotides, 200 pM of dNTPs, and 0.5 units of Taq 
polymerase in the buffer supplied by the manufacturer (GIBCO-BRL). PCR reactions 
were heated at 94°C for 2 minutes followed by 30 cycles of amplification. Each cycle 
consisted of segments of 94°C for I minute, 48°C for 2 minutes, and 72°C for 1 
minute. An extra 10 minutes was added to the 72°C extension step for the last cycle. 
The PCR products were separated on a 1% agarose gel. A 600-base pair DNA 
fragment was isolated from the gel and subsequently ligated to the vector pCR2.1 
using the TA cloning kit (Stratagene). The 600-base pair insert was recovered from 
the vector by digesting with EcoPJ and was then labeled with 32 P. The probe (3xl0 5 
cpm/pl) was denatured in boiling water for 5 minutes before being used to screen a 
Xenopus oocyte cDNA library (Mueller et al. y (1995) Mol Biol Cell, 6:1 19-134). 
Approximately IxlO 6 independent colonies were screened. After secondary 
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screening, a full-length cDN A clone was obtained. Both strands of the cDNA were 
sequenced by primer walking with a dye terminator cycle sequencing kit and an ABI 
model 373 automated sequencer (Perkin-Elmer Corp.). 

[0146] The cDNA sequence contains a large open reading frame, a V 

poly(dA) stretch representing the poly(A) tail of the mRNA, and multiple in-frame 
translation termination codons upstream of the coding region. 

10147] Antibody Production. To clone the coding sequence of Xcdsl into 

the expression vector pET30(a)+ (Novagen), a BamlR site upstream of the initiation 
codon and a Xhol site downstream of the termination codon were introduced by 
performing PCR with the primers 

GGACGTCGGATCCTCTCGTGATACTAAAACAGAG (SEQ ID NO:8) and 
GGACGTCCTCGAGTTATCTTTTTGCTCTCTTTTCGG (SEQ ID NO:9). The 
resulting 1.5 kilobase PCR product was digested with B a ml 11 and Xhol, and ligated to 
pET30(a)+ that had been digested with BamYll and Xhol. The pET30(a)-Xcdsl 
construct was introduced into the£ coli host strain BL21(DE3)pLysS (Novagen). 
Expression of His6-Xcdsl was induced by growing the cells at 37 P C for 3 hours in the 
presence of 0.4 mM IPTG. His6-Xcdsl was purified by nickel agarose 
chromatography, subjected to SDS-PAGE, and subsequently excised from the gel. 
Polyclonal rabbit antibodies against gel purified His6-Xcdsl were produced 
commercially (Covance Research Products Inc.). Antibodies against a peptide 
corresponding to the C-terminal end of Xcdsl (CSEILPTSAEKRAKR; SEQ ID 
NO: 10) were generated at another commercial facility (Zymed Laboratories Inc.). 

[0148] The Xcdsl cDN A encodes a 58 kd translation product of 5 1 7 amino 

acids (Figure 2). Xcdsl is most related to Chk2 (63% identical and 76% similar), the 
human homologue of Cdsl (Matsuoka et al, (1 998) supra, Brown et al, (1999) 
supra, Blasina et al y (1999) supra, Chaturvedi et al, (1999) Oncogene, 18, 4047- 
4054). The most conserved areas are the carboxyl-terminal kinase domain and 
amino-terminal forkhead-associated domain (Matsuoka et al., 1998). The extreme 
amino-terminal region of Xcdsl is less conserved at the primary sequence level, but is 
rich in SQ and TQ amino acid pairs, which are part of the consensus site for 
phosphorylation by the DNA-dependent protein kinase family (Anderson, (1993) 
supra). By using histidine-tagged Xcdsl (His6-Xcdsl) as antigen, polyclonal 
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antibodies against Xcdsl were generated. Affinity-purified anti-Xcdsl antibodies 
recognized a single protein of approximately 58 kd in Xenopus egg extracts. 
Antibodies against the carboxyl-terminal 15 amino acids of Xcdsl recognized a 
polypeptide of the same size. 

EXAMPLE 2 

Preparation of Various DNA Templates and Egg Extracts 

[0149] Ml 3 single-stranded DNA was prepared according to a protocol 

included in a mutagenesis kit from Amersham (Oligonucleotide-directed in vitro 
mutagenesis system version 2). The pBS plasmids were prepared according to an 
alkaline lysis protocol (Sambrook etaL y (1989) Mol. Cloning: a laboratory manual, 
Cold vSpring Harbor Laboratory Press, Cold Spring Harbor, NY). To generate 
linearized or open-circularized plasmids, ten microgram of plasmid pBS (Bluescript 
SK-phagemid 7 Stratagene) was either digested with restriction enzymes in the 
appropriate buffers under standard condition, or incubated at 56°C for 36 hours in TE 
buffer (10 mM Tris HC1, 1 mM EDTA, pH 8.0). The resulting linearized and open- 
circularized plasmids were then diluted with distilled water to a concentration of 0.2 
Hg/pl. Untreated PBS was also diluted accordingly. Oligonucleotides were 
synthesized at the DNA Synthesis Facility of California Institute of Technology 
(Caltech). Concentrated oligonucleotides were diluted with distilled water to a 
concentration of 2 pg/pl The sequences for the two complementary "randoms- 
sequence oligonucleotides are GACTCCGAGAACCAATTGC (Oligo 1; SEQ ED 
NO:3) and GCAATTGGTTCTCGGAGTC (Oligo 2; SEQ ED NO:4). The sequence 
for the oligonucleotide capable of forming a hairpin structure is 
G CGGCACG TTCT CGTGCCGC (Oligo 3; SEQ ID NO:5). Oligo 1 and Oligo 2 (10 
pg each) were annealed in 20 pi of HBS buffer (10 mM HEPES, pH 7.5, 150 mM 
NaCl) in a water bath (500 ml of water) which was kept at 70°C for 5 minutes and 
then cooled down at room temperature for 2.5 hrs. Oligo 3 (20 fig) was self-annealed 
by following the same procedure. To disrupt the duplex region, Oligo 3 was boiled in 
water for 10 minutes and then cooled on ice immediately. 

[01501 Xenopus egg extracts were prepared as described (Murray, (1991) 

Methods Cell Biol 36, 581-605) Extracts supplemented with 100 pg/ml of 
cycloheximide were incubated at 23°C for 90 minutes following addition of DNA to a 
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final concentration of either 10 ng/pl for plasmids and M13 DNA or 50 ng/pl for 
oligonucleotides. 

EXAMPLE 3 
Oligonucleotide Replication Assay 

[0151] Oligonucleotides were labeled at the 5 ? -end with T4 polynucleotide 

kinase. Briefly, four microgram of each oligonucleotide was incubated with 100 pCi 
of 32 P-ATP and 20 units of T4 polynucleotide kinase (New England Biolabs) in a total 
volume of 40 pi of reaction buffer (50 mM TrisCl, pH 7.5, 10 mM MgCl 2? 5 mM 
DTT). The reaction was performed at 37°C for 45 minutes. At the end of the 
reaction, 60 pi of distilled water was added to the reaction followed by a spin-column 
purification (Sephadex G-50) to remove the unincorporated 32 P-ATP. Ten microliter 
of the 32 P-labeled oligonucleotides (approximately 1 X 10 5 cpm/pl) was added to 
100 pi of interphase egg extract at room temperature. The first sample (10 pi of 
extract) was taken and frozen in liquid nitrogen immediately after addition of the 
oligonucleotides. An aliquot (10 pi) of extract was then taken and frozen every 15 
minutes. All samples were thawed on ice and diluted 20-fold with HBS buffer (10 
mM Hepes, 150 mM NaCl, pH 7.5). The diluted extract was deproteinized by 
extractions with an equal volume of phenol/chloroform (1:1) once and with 
chloroform three times. The protein- free sample (5 pi) was loaded onto a 10% native 
polyacrylamide gel for electrophoresis. The radiolabeled oligonucleotides were 
visualized by autoradiography. 

EXAMPLE 4 

Production of His6-Xcdsl and His6-Xcdsl-N324A Proteins from Bacteria. 

[0152] To introduce the N324A mutation into the Xcdsl gene, two pairs of 

primers were used to amplify the wild-type Xcdsl sequence: 
GGACGTCGGATCCTCTCGTGATACTAAAACAGAG (SEQ ED NO 1 1) and 
GGACTGGGTCGACGACAACAGCACAGCTTCAGGCTTCAG (SEQ ID NO; 12); 
GGACGTCCTCGAGTTATCTTTTTGCTCTCTTTTCGG (SEQ ID NO: 13) and 
GGTTGTCGTCGACTAGTGAAGAATGTTGCAT (SEQ ID NO: 14). The resulting 
PCR products were digested with BamW and Sail, and Sail and EcoRl, respectively, 
and ligated into the expression vector pET30(a)+ (Novagen) that had been digested 
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with BamHl and EcoRl. A three-fragment ligation produced a construct pET30(a)- 
Xcdsl-N324A. Both pET30(a)-Xcdsl-N324A and pET30(a)-Xcdsl (constructed as 
described herein for antibody production) were introduced into the K coli strain 
BL21(DE3)pLysS (Novagen). Expression of the His6-Xcdsl and His6-Xcdsl- 
N324A proteins was induced by growing the cells at 30°C for 3 hours in the presence 
of 0.4 mM IPTG in the medium. The proteins were purified using nickel agarose 
chromatography. 

EXAMPLE 5 
Kinase Assays 

10153] His6-Xcdsl or His6-Xcdsl-N324A proteins were incubated with GST- 

Cdc25(254-316)-WT or GST-Cdc25(254-316)-S287A in 20 pi of kinase buffer (10 
mM Tris-HCl, pH 7.5, 10 mM MgCl 2 > and 1 mM DTT) containing 2 pCi 32 P-ATP and 
10 pM ATP (Kumagai et al. y (1998)7. Cell BioL y 142, 1559-1569; herein 
incorporated by reference in its entirety). The reaction was performed at 23°C for 1 5 
minutes and terminated by adding 20 pi of gel loading buffer. The proteins were 
separated by SDS-PAGE and the phosphorylated proteins were detected using a 
Phosphorlmager (Molecular Dynamics). To detect the kinase activity of endogenous 
Xcdsl protein, interphase egg extracts either lacking or containing poly(dT) 40 were 
incubated at 23°C for 90 minutes. Next, the extracts were incubated with affinity- 
purified anti-Xcdsl antibodies bound to 10 pi of Affiprep protein A beads (Bio-Rad 
Laboratories) at 4°C for 60 minutes. After centrifugation, the extract supernatant was 
removed. The Affiprep protein A beads were washed three times with 1 ml of 10 mM 
Hepes, pH 7.5, 150 mM NaCl, 30 mM glycerophosphate, 0.1 mM NaV0 4 , 0.5% NP- 
40, 0.1% SDS, 0.1 mM phenylmethylsulfonyl fluoride, 10 pg/ml of leupeptin, 10 
pg/ml of chymostatin, and 10 pg/ml of pepstatin, and once with 1 ml of 10 mM Tris- 
HCl, pH 7.5, 10 mM MgCl 2 , and 0.1 mM phenylmethylsulfonyl fluoride. Kinase 
reactions were performed at 23°C for 15 minutes by incubating the 
immunoprecipitates in 20 pi of buffer containing 10 mM Tris-HCl, pH 7.5, 10 mM 
MgCl 2 , 1 mM DTT, 2 jaCi 32 P-ATP, 10 pM ATP, and 2 pg of GST-Cdc25(254-316)- 
WT. 
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EXAMPLE 6 

Xcdsl is Phosphorylated in Response to Double-stranded DNA Ends, 

J0I54] In fission yeast and human cells, Cdsl is phosphorylated and activated 

in response to DNA damage and/or replication blocks (Murakami and Okayama 
(1995) Nature 374, 817-819; Sanchez et al, (1996) Science 271, 357-360; Boddy et 
al, (1998) Science 280, 909-912; Lindsay et al, (1998) . Genes Dev. 12, 382-395; 
Matsuoka et al, (1998) Science 282, 893-897; Brown et al, (1999) Proc. Natl. Acad. 
Sci. USA 96, 3745-3750; Blasina etal, (1999) Curr Biol. 9, 1-10; Chaturvedi etal, 
(1999) Oncogene 75, 4047-4054; Brondello etal, (1999) Mol. Cell. Biol 19, 4262- 
4269). To test whether Xcdsl is similarly modified, the endogenous Xcdsl protein in 
the nuclear fraction of Xenopus egg extracts was examined by immunoblotting. 

[01551 Modifications of Xcdsl in response to M13 DNA and linearized 

plasmids were examined by immunoblot experiments. Interphase extracts containing 
2,000 sperm nuclei/pl, 2,000 sperm nuclei/^il and 100 ng/ml aphidicolin, or 2,000 
UV-damaged sperm nuclei/pl were incubated at 23°C for 100 minutes. Seventy 
microliter of each extract was centrifiiged through a sucrose cushion to isolate the 
nuclear fractions, which were subjected to SDS-PAGE and immunoblotting. After 
detection of Xcdsl, the immunoblot was stripped and probed with anti-Xchkl \ 
antibodies. Interphase egg extracts containing 10 ng/(il of M13 DNA or the same 
amount of Ml 3 DNA and 5 (xg/ml of actinomycin D were incubated at 23°C. An 
aliquot of extract (2 pi) was taken and frozen every 20 minutes after the addition of 
Ml 3 DNA. Xcdsl protein in each aliquot was then detected by immunoblotting. 
Extracts containing various DNA templates (lanes 1-6) or the same DNA templates 
and 5 mM caffeine (lanes 7-12) were incubated at 23 °C for 90 minutes and then 
analyzed for Xcdsl protein by immunoblotting. The immunoblot was subsequently 
stripped and probed for Xchkl protein. 

[0156] Xcdsl did not show a reduction in mobility during SDS- 

polyacrylamide gel electrophoresis (SDS-PAGE) when chromosomal DNA 
replication was blocked by the DNA polymerase inhibitor aphidicolin or when the 
nuclear DNA was damaged with UV. In contrast, as was previously reported and 
confirmed herein, Xenopus Chkl (Xchkl) was modified in response to both 
aphidicolin and UV (Kumagai et al, (1998) J. Cell Biol. 142, 1559-1569). 
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[0157] It has been reported that single- stranded Ml 3 DNA also elicits a 

checkpoint response in Xenopus egg extracts (Kornbluth et al y (1992) Mol. Cell. Biol. 
72, 3216-3223). While Xchkl mediates the response to aphidicolin and UV 
(Kumagai et aL, 1998, supra), it is unclear which effector kinase is regulated by Ml 3 
DNA. To examine this issue, single-stranded Ml 3 DNA was added to interphase 
extracts at a concentration (10 ng/|il) that delays mitosis substantially (Kornbluth et 
al., 1992). Incubation of the extracts with M13 DNA for 40 minutes or longer 
resulted in a dramatic decrease in the electrophoretic mobility of Xcdsl. This 
modification of Xcdsl was blocked by caffeine, an agent that overrides checkpoint 
Conversely, M13 DNA did not trigger the modification of Xchkl. 

10158] Although Ml 3 DNA was added to egg extracts in a single-stranded 

form, it is a very efficient template for DNA synthesis in such extracts. Within 40 
minutes, it is quantitatively converted to double-stranded DNA, which mainly consists 
of three forms: closed-circular, open-circular, and linearized (Mechali and Harland, 
1982). Significantly, the modification of Xcdsl did not occur until the M13 DNA had 
been incubated for 40-60 minutes in egg extracts, by which time the replication of this 
template has reached completion. Furthermore, inhibition of M13 DNA replication 
with actinomycin D (5 |ag/ml) prevented the modification of Xcdsl (Mechali and 
Harland, (1982) Cell 30, 93-101). Therefore, it is possible that a form of double- 
stranded DNA derived from M13, but not the single-stranded DNA itself, is the 
checkpoint signal. To test this hypothesis, closed-circular plasmid DNA, plasmids 
linearized by restriction enzymes, or open-circular plasmids generated by heating at 
56°C were added separately to the extracts (Hofferer et a/., (1995) Nuc. Acid. Res. 23 y 
1396-1397). Plasmids that had been digested with three different restriction enzymes, 
Hindltt, Sma\ 7 and Kpn\ which respectively produce 3 '-protruding, blunt, and 5'- 
protruding ends, strongly induced the modification of Xcdsl . In contrast, closed- 
circular or open-circular plasmids did not have any effect. The modification of Xcdsl 
induced by linearized plasmids was also abolished by caffeine. 

[0159] Unlike single- stranded DNA, double-stranded plasmid DNA is not 

efficiently replicated in whole egg extracts. Its metabolism in the extracts is not quite 
clear, except that it has been reported that linearized plasmids are partially re- 
circularized and multimerized by non-homologous end joining (Labhart, (1999) Mol. 
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Cell. Biol. 19, 2585-2593). Nonetheless, the linearized plasmids appeared to mimic 
the presence of damaged DNA with double-stranded breaks, which induced the 
modification of Xcdsl in the extracts. 

[0160] To evaluate further the nature of the checkpoint signal, other defined 

DNA molecules, i.e., four types of DNA homopolymers, each 40 nucleotides long, 
were added to the extracts. Extracts containing DNA homopolymers or the same 
DNA templates and 5 mM caffeine were incubated at 23 °C for 90 minutes and 
analyzed for Xcdsl protein by immunoblotting. The immunpblot was stripped and 
probed for Xchkl protein. Radiolabeled poly(dT) 4 o, and poly(dC) 40 , were incubated 
with 100 pi of interphase extract. Just after the addition of the homopolymers 
(marked as 0 min) and every 15 minutes (15-90 min) afterwards, a 10 pi sample was 
taken and deproteinized. The homopolymers were detected by autoradiography after 
native polyacrylamide gel electrophoresis. Interphase egg extracts containing 50 
ng/pl of oligonucleotide duplex (oligo 1 + oligo 2) were incubated at 23°C. An 
aliquot of extract (2 pi) was taken and frozen every 10 minutes after the addition of 
oligonucleotides. Xcdsl protein in each aliquot was detected by immunoblotting. 
Xcdsl protein immunoprecipitated from extracts containing 50 ng/pl of poly(dT) 4 o 
was treated with either lambda phosphatase or buffer for 60 minutes or for 100 
minutes at 30°C As a control, Xcdsl was also immunoprecipitated from untreated 
interphase extracts. Xcdsl protein in the immunoprecipitates was examined by 
immunoblotting. 

10161] Interestingly, modification of Xcdsl occurred only in the extracts 

containing poly(dT) 40 , and this modification was likewise sensitive to caffeine. 
Significantly, as is the case for Ml 3 DNA, linearized plasmids and poly(dT) 40 did not 
induce the modification of Xchkl under these conditions. 

[0162] Why did poly(dT) 40 but not the homopolymers poly(dC) 4 o> poly(dA) 40 , 

and poly(dG) 4 o induce the modification of Xcdsl? One plausible reason is that 
poly(dT) 40 could be converted to double-stranded DNA by replication in the egg 
extract, while the others remained as single-stranded oligonucleotides. Indeed, the 
eukaryotic DNA primase-polymerase a complex initiates RNA primer and DNA 
synthesis efficiently on polypyrimidinic templates in vitro (Gronostajski et al. 7 (1984) 
J. Biol. Chem. 259 y 9479-9484; Grosse and Krauss, (1985) J. Biol. Chem. 260, 1881- 
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1888; Yamaguchi et al 7 (1985) J. Biol Chem 260, 6254-6262; Yamaguchi et al. y 
(1985) Mol. Cell. Biol. J, 1 1704183), whereas primer synthesis on the polypurines 
poly(dG) and poly(dA) is negligible. Xenopus egg extracts contain a high 
concentration of ATP (>1 mM), which may both promote primer synthesis on 
poly(dT) and suppress formation of primers on poly(dC) (Yamaguchi et aL, 1985, 
supra). One approach to test this hypothesis would be to examine the electrophoretic 
mobility of the homopolymers in a native polyacrylamide gel after incubation in egg 
extracts, since homopolymer duplexes usually migrate faster than the single-stranded 
ones. Experiments confirm that a poly(dT)4o :poly(dA)4o duplex showed an increased 
mobility in a native gel compared to single-stranded poly(dT) 4 o- Poly(dT) 4 o and the 
other homopolymers were labeled at the 5' end with 32 P by using T4 polynucleotide 
kinase, and subsequently incubated in interphase egg extracts for 90 minutes. Most of 
poly(dT)4o, was converted within 15 minutes to a form that migrated faster in the gel 
at the position of a duplex. This conversion did not occur in the case of poly(dC)4o, 
poly(dG) 40? or poly(dA) 4 o- In agreement with these observations, it was found that an 
oligonucleotide duplex (19 base pairs) preformed from two complementary 
oligonucleotides with "random" sequences, and an oligonucleotide containing a 
hairpin structure both elicited the modification of Xcdsl. In these experiments, 
double- stranded DNA ends were required for the modification of Xcdsl, because each 
of the two "randoms-sequence oligonucleotides by themselves did not have this 
effect, and because disruption of the hairpin structure by heating the oligonucleotide 
abolished its signaling capacity. The response of Xcdsl to the preformed 
oligonucleotide duplex occurred more quickly (within 20 minutes) than was the case 
for Ml 3 DNA, which would be consistent with the notion that the duplex does not 
need to undergo replication to trigger a response from Xcdsl . 

[0163] Collectively, these findings strongly suggest that double-stranded DNA 

ends are the signal that triggers the modification of Xcdsl. This modification 
represents phosphorylation because treatment of the modified Xcdsl with lambda 
phosphatase reversed the mobility alteration. Interestingly, the signaling from double- 
stranded DNA to Xcdsl is a process that does not require membranes or an intact 
nuclear environment, since the phosphorylation of Xcdsl in response to poly(dT) 4 o 
occurred in membrane-free egg cytosol that was incubated under the same conditions 
as whole egg extracts. 
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EXAMPLE 7 

Xcdsl is Activated by the Addition of Poly(dT)40 to Egg Extracts 

[0164] It has been shown that fission yeast Cdsl and human Chk2/Cdsl 

phosphorylate Cdc25 within a 14-3-3 binding site (Zeng et al y (1998) Nature 395, 
507-5 10; Matsuoka et al\ (1998) supra, Brown et al, (1999) supra; Blasina et al, 
(1 999) supra). The binding of 14-3-3 proteins to Cdc25 is required for a normal . 
checkpoint response in humans, Xenopus egg extracts, and fission yeast (Peng et al, 
(1997) Science 277, 1501-1505; Kumagai et al, (1998), supra, Zeng et al, (1998)). 
Therefore, experiments were conducted to test whether Xcdsl would phosphorylate 
Xenopus Cdc25 in vitro. 

[0165] His6-Xcdsl and His6-Xcdsl-N324A were purified from E. coli and 

incubated with GST-Cdc25 [254-31 6]- WT or GST-Cdc25[254-316]-S287A in kinase 
buffer containing 32 P-ATP. The proteins were separated by SDS-PAGE and stained 
with Coomassie blue. The phosphorylated proteins were detected by using a 
Phosphorlmager. Immunoprecipitation was performed from interphase egg cytosol 
containing poly(dT) 4 o or no DNA using anti-Xcdsl antibodies or non-specific rabbit 
IgG. One third of each immunoprecipitate was analyzed for the modification of 
Xcdsl by immunoblotting; two thirds of each immunoprecipitate was incubated with 
GST-Cdc25[254-3 16]-WT to measure kinase activity 

[0166] His6-Xcdsl, but not a catalytically inactive mutant (His6-Xcdsl- 

N324A), phosphorylated a 62-amino acid region of Xenopus Cdc25 fused to 
glutathione S-transferase (GST-Cdc25 [254-3 16]-WT) (Kumagai et al, (1998) supra). 
The phosphorylation occurs on Ser-287 in the 14-3-3 binding site of Xenopus Cdc25, 
because a serine-to-alanine mutation at this position abolished the phosphorylation. 
Similarly, endogenous Xcdsl immunoprecipitated from egg extracts phosphorylated 
this substrate well. Moreover, the hyperphosphorylated Xcdsl protein 
immunoprecipitated from extracts containing poly(dT)4 0 showed a five- to six-fold 
increase over background in its kinase activity towards GST-Cdc25 [254-3 16]-WT, 
indicating that this simple template also triggers the activation of Xcdsl. 
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EXAMPLE 8 

DNA Templates with Double-stranded DNA Ends Delay Mitosis, 

[0167] Single-stranded Ml 3 DNA delays mitosis in cycling egg extracts, an 

effect that can be overridden by the base analogue caffeine (Kornbluth et ai 7 (1992) 
supra). M13 DNA, as well as other simple DNA molecules that either contain or 
generate double-stranded ends, promote the phosphorylation of Xcdsl . It is possible 
that Ml 3 DNA delays mitosis by activating Xcdsl . If this were the case, the other 
simple DNA molecules, such as poIy(dT) 40 , linearized plasmids, and oligonucleotide 
duplexes, might also delay mitosis because of their capacity to activate Xcdsl. 

|0168] The response of Xenopus egg extracts to poly(dT)4o in the presence and 

absence of Xcdsl was assessed. Poly(dT)4o or poly(dG) 4 o was added to the extracts 
at a final concentration of 50 ng/pl in the presence or absence of 5 mM caffeine. The 
extracts were activated with CaCl 2 before the addition of DNA. Sperm nuclei (200 
nuclei/|il) were added to the extracts in order to monitor the timing of nuclear 
envelope breakdown (NEB) by microscopy. The effect of removal of Xcdsl and 
Xchkl proteins from egg extracts by immunodepletion was also examined. M-phase 
extract (100 pi) was incubated with a mixture of anti-Xcdsl antibodies (20 pg) and 
anti-Xchkl antibodies (10 pg) bound to Affiprep protein A beads for 50 minutes at 
4°C with constant rocking. Protein A beads were removed by centrifugation. A 
second round of depletion was then performed to ensure that both Xcdsl and Xchkl 
were completely removed, which was assessed by immunoblotting. As a control, M- 
phase extracts were treated by the same procedure with non-specific rabbit IgG. For 
depletion of Xcdsl alone, anti-Xchkl antibodies were omitted. Depletion of Xcdsl 
does not diminish the mitotic delay caused by poly(dT) 40 . Sperm nuclei (500 
nuclei/pl) or both sperm nuclei (500 nuclei/pl) and poly(dT) 4 o (50 ng/pl) were added 
to Xcdsl -depleted or mock-depleted extracts. 

[0169] Significantly, poly(dT) 40 was found to greatly delay mitosis in egg 

extracts as did M13 DNA and an oligonucleotide duplex. The delay of mitosis by 
poIy(dT) 4 o was partially but not completely reversed by caffeine. This observation is 
reminiscent of the fact that the aphidicolin-induced checkpoint in Xenopus egg 
extracts involves both caffeine-sensitive and caffeine-insensitive pathways (Kumagai 
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et al., (1998), supra). In contrast, poly(dG) 4 o, which is not converted to a double- 
stranded form, did not significantly delay the timing of mitosis. 

EXAMPLE 9 

Immunodepletion of Xchkl and Xcdsl From Egg Extracts Does Not 
Compromise the Mitotic Delay Induced by DNA Molecules with 
Double-stranded Ends 

[01 70J To test whether Xcdsl mediates the mitotic delay induced by 

poly(dT)4o, Xcdsl was completely removed from egg extracts by immunodepletion. 

[0171] One hundred microliter of M phase extract was incubated with either 

20 jig of affinity-purified anti-Xcdsl antibodies, 10 |ig of affinity-purified anti-Xchkl 
antibodies, or both bound to 10 pi of Affiprep protein A beads (Bio-Rad Laboratories) 
at 4°C for 50 minutes. The same amount of control rabbit IgG (Zymed Laboratories 
Inc.) was used for mock-depletion. After the incubation, the beads were removed by 
centrifugation. The supernatants were treated again under the same conditions to 
ensure that Xcdsl and/or Xchkl were quantitatively removed from extracts. 

[0172] The response of Xenopus egg extracts to an oligonucleotide duplex and 

Ml 3 DNA in the presence and absence of Xcdsl was also examined. Sperm nuclei 
(500 nuclei/|il) or both sperm nuclei (500 nuclei/pl) and oligonucleotide duplex (oligo 
1 + oligo 2) (50 ng/pl) were added to Xcdsl -depleted or mock-depleted extracts. 
Sperm nuclei (500 nuclei/pi) were also added to untreated extracts. The timing of 
nuclear envelope breakdown (NEB) was monitored by microscopy. 

[0173] Interestingly, egg extracts lacking Xcdsl still displayed an intact 

mitotic delay in response to poly(dT)^o. Furthermore, depletion of Xcdsl did not 
compromise the delay of mitosis induced by Ml 3 DNA or a double- stranded 
oligonucleotide. The fact that egg extracts containing double-stranded DNA ends 
arrest efficiently in the absence of Xcdsl could have several explanations. For 
example, it is possible that Xchkl could compensate for the absence of Xcdsl . 
Indeed, in fission yeast, Chkl is mainly required for the damage checkpoint, but can 
substitute for Cdsl in the replication checkpoint when Cdsl is missing (Murakami 
and Okayama, (1995) supra; Lindsay et aL 7 (1998) supra, Boddy et aL y supra (1998); 
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Brondello et aL y (1999) supra). However, egg extracts lacking both Xcdsl and 
Xchkl still underwent a mitotic delay in the presence of poly(dT) 40 . Consistent with 
this observation, Xchkl did not undergo checkpoint-associated phosphorylation in the 
presence of double-stranded DNA ends, even when Xcdsl had been removed . 

EXAMPLE 10 

Kinase Activity Towards Ser-287 Of Xcdc25 Remains In Egg Extracts Lacking 

Xcdsl and/or Xchkl 

[0174] The assay for kinase activity toward Ser-287 of Xenopus Cdc25C in 

egg extracts was performed as described before (Kumagai et aL, 1998, supra). To 
block chromosomal DNA replication, aphidicolin (dissolved in DMSO at 10 mg/ml) 
was added to egg extracts to a final concentration of 100 pg/ml. 

[0175] Since Xchkl does not undergo checkpoint-associated phosphorylation 

in the presence of double- stranded DNA ends even when Xcdsl has been removed, 
there could be a distinct checkpoint pathway(s) that acts independently of Xcdsl and 
Xchkl . This pathway could either be normally activated by the presence of double- 
stranded DNA ends or perhaps be activated when Xcdsl is absent. It seems plausible 
that vertebrates such as Xenopus could have checkpoint systems that are more 
complex than those of lower eukaryotes. To address this possibility, total kinase 
activity toward the substrate GST-Cdc25 [254-3 16]-WT was examined in egg extracts 
lacking Xchkl, Xcdsl , or both. 

[0176] M-phase extracts were immunodepleted with either anti-Xcdsl 

antibodies, anti-Xchkl antibodies, both anti-Xcdsl and anti-Xchkl antibodies, or non- 
specific rabbit IgG. The treated extracts were activated with CaCl 2 and incubated at 
23 °C for 90 minutes in the presence or absence of poly(dT) 4 o- Finally, total kinase 
activity towards GST~Cdc25[254-316]-WT or GST-Cdc25[254-316]-S287A was 
assayed in each sample. Only kinase activity toward GST-Cdc25 [254-3 16]-WT was 
quantitated, since phosphorylation of the S287A mutant was negligible. 



[0177] Approximately 70% of the kinase activity toward Ser-287 of Cdc25 

remained in egg extracts when both Xcdsl and Xchkl were removed. 
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EXAMPLE 10 

[0178] The biochemical properties of Xcdsl, ?l Xenopus homologue of the 

checkpoint kinase Cdsl were analyzed. Collectively, these data make a strong case 
that templates containing double-stranded DNA ends trigger the phosphorylation of 
Xcdsl . Nonetheless, the exact chemical nature of the DNA structure that is being 
recognized remains to be elucidated. Furthermore, poly(dT)4o, also elicited an 
increased kinase activity of Xcdsl toward Ser-287 in the 14-3-3 binding site of 
Xenopus Cdc25. Other templates are also likely to activate Xcdsl . Since double- 
stranded DNA ends would typically be formed upon exposure to DNA damaging 
agents such as ionizing radiation, Xcdsl may normally be activated in response to this 
type of DNA damage in Xenopus cells. 

[0179] Interestingly, the response of Xcdsl to DNA templates is quite distinct 

from that of Xchkl, a Xenopus homologue of Chkl. As described previously, Xchkl 
undergoes a marked phosphorylation in egg extracts when replication is blocked by 
the DNA polymerase inhibitor aphidicolin or when UV-damaged sperm chromatin is 
added to the extracts (Kumagai et aL, (1998) supra). In contrast, aphidicolin and UV 
damage do not elicit phosphorylation of Xcdsl, as monitored by one-dimensional 
SDS-PAGE. Conversely, Ml 3 DNA^ linearized plasmids, double-stranded 
oligonucleotides, and poly(dT) 4 o, each of which caused a strong phosphorylation of 
Xcdsl , did not have a detectable effect on Xchkl . It appears that Xcdsl and Xchkl 
respond to quite different signals from DNA in the egg extract system. As shown 
herein, Xcdsl responds to double-stranded DNA ends, but the signal that leads to 
modification of Xchkl is not known. In the presence of aphidicolin, the initial firing 
of replication origins can proceed but replication is blocked after the priming stage 
(Mahbubani et al y (1997) J. Cell. Biol. 136, 125-135), suggesting that some aspect of 
stalled DNA replication forks triggers the phosphorylation of Xchkl . UV radiation, 
which causes the formation of pyrimidine dimers, is also a very efficient signal for the 
modification of Xchkl. The recognition, excision, and/or repair of pyrimidine dimers 
by repair/replication factors in egg extracts could also lead to the modification of 
Xchkl . Furthermore, it is important to note that, at the doses of UV that were used, 
replication of UV-treated sperm chromatin in Xenopus egg extracts is strongly 
impaired. Thus, it is possible that aphidicolin and UV radiation both cause 
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accumulation of a similar DNA structure, which may in turn lead to the modification 
ofXchkl. 

[0180] Although Cdsl and Chkl have been well conserved throughout 

evolution, the respective roles of these kinases in various checkpoint responses appear 
to vary depending on the species. In budding yeast, the closest homologue of Cdsl 
(Rad53) responds to both DNA damage (induced by exposure to methylmethane 
sulfonate) and DNA replication blocks (induced by treatment with hydroxyurea), 
although the response to hydroxyurea was less pronounced (Sun et al. y (1996) Genes 
Dev. 10 y 395-406; Sanchez et al y (1996) Science 277, 1497-1501. In wild-type 
fission yeast cells, Chkl responds to DNA damage (induced by ionizing radiation, 
methylmethane sulfonate, or UV light), but not hydroxyurea treatment (Walworth and 
Bernards, (1996) Science 277, 353-356. However, in fission yeast mutants lacking 
Cdsl, Chkl does respond to hydroxyurea (Lindsay et al. y (1998) supra; Boddy et al. y 
(1998) supra; Brondello et al. y (1999) supra. Although this finding may suggest that 
fission yeast Chkl can also respond to DNA replication blocks, it has been proposed 
that Cdsl is required to suppress DNA damage in hydroxyurea-treated cells. Finally, 
fission yeast Cdsl responds strongly to treatment with hydroxyurea. Fission yeast 
Cdsl can also be activated in response to DNA damage, but its responsiveness to 
damage is restricted to S-phase (Murakami and Okayama, (1995) supra; Lindsay et 
al y (1998) supra; Brondello et al y (1999) supra). Collectively, the available data 
indicate that fission yeast Cdsl is activated by DNA replication blocks and exposure 
to damaging agents during S-phase, which may indirectly induce replication blocks. 
In contrast, Chkl responds mainly to DNA damage and plays a more specialized role 
in the response to hydroxyurea. Thus, there appear to be significant differences 
between fission yeast and Xenopus egg extracts with respect to how Cdsl and Chkl 
respond to the DNA structures that trigger checkpoints. 

|0181] In human cells, Chk2, a Cdsl homologue, is activated upon exposure 

to ionizing radiation, UV light, and hydroxyurea, but the response to ionizing 
radiation is the strongest (Matsuoka et al y (1998) supra; Brown et al. y (1999) supra. 
Significantly, the checkpoint kinase ATM, a member of the DNA-dependent protein 
kinase family that is defective in individuals with ataxia telangiectasia (AT), has been 
implicated as an upstream regulator of human Chk2/Cdsl (Matsuoka et al. y (1998) 
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supra; Brown et ah, (1 999) supra). AT cells are very sensitive to ionizing radiation, 
but display a normal DNA replication checkpoint (Cliby et ah, (1998) EMBO J. 17, 
159-168). Thus, although human Chk2/Cdsl appears to respond to a wider variety of 
signals than Xcdsl, human Chk2/Cdsl is similar to Xcdsl in that it is clearly involved 
in a pathway that responds to ionizing radiation/double-stranded DNA breaks. 
Relatively less has been reported about the signals to which human Chkl responds. 
Sanchez et aL ((1997) supra) found that that human Chkl is modified in response to 
ionizing radiation, but the extent of modification was much less than what has been 
observed for human Chk2/Cdsl (Matsuoka et al y (1998) supra). The effect of 
hydroxyurea or other agents that induce DNA replication blocks on human Chkl has 
not been reported. Accordingly, the role of human Chkl in the DNA replication 
checkpoint is unclear at this time. Significantly, the Drosophila homologue of Chkl 
(Grapes) has also been implicated in the replication checkpoint in fly embryos 
(Fogarty et al, (1997) Curr. Biol. 7, 418-426; Sibon et al, (1997) Nature 388, 93-97), 
suggesting that Xchkl and at least one other metazoan Chkl homologue may fulfill 
similar functions. 

[01 82] It was shown previously that the cell cycle delay in Xenopus egg 

extracts that is induced in response to treatment with aphidicolin involves multiple 
pathways: a caffeine-sensitive pathway containing Xchkl and a caffeine-insensitive 
pathway. In view of the findings reported herein, Xcdsl is not a viable candidate for 
an effector of the caffeine-insensitive component of the aphidicolin-induced 
checkpoint for a number of reasons. First, Xcdsl is not phosphorylated in response to 
aphidicolin. Second, the phosphorylation of Xcdsl by the presence of double- 
stranded DNA ends in egg extracts is completely abolished by caffeine. Finally, 
immunodepletion of both Xcdsl and Xchkl from egg extracts did not further 
compromise the aphidicolin-induced checkpoint in relation to extracts from which 
Xchkl alone had been removed. 

[0183] The pathways that control the response of egg extracts to double- 

stranded DNA ends may be even more complex than those that are triggered by DNA 
replication blocks. As is the case for aphidicolin, the delay of the cell cycle induced 
by poly(dT) 4 o is only partially abrogated by treatment with caffeine. Thus, the cell 
cycle delay in response to poly(dT) 40 also appears to involve both caffeine-insensitive 
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and caffeine-sensitive pathways. The nature of the caffeine-insensitive pathway(s) 
that is triggered by poly(dT) 40 is unknown. Likewise, it is unclear whether 
aphidicolin and po!y(dT) 40 trigger the same or distinct caffeine-insensitive pathways. 

[0184] Since the phosphorylation/actiyation of Xcdsl in response to 

poly(dT)4o is abolished by caffeine, Xcdsl would be a candidate for a component of 
the caffeine-sensitive pathway(s). However, immunodepletion of Xcdsl does not 
compromise the cell cycle delay that is induced by poly(dT) 40 . Similar results were 
obtained when Ml 3 DNA or a double- stranded oligonucleotide were used to delay the 
cell cycle. It is possible that Xcdsl becomes phosphorylated due to the presence of 
various DNA templates, but does not play a role in the cell cycle delay that is 
triggered by them. This explanation would be somewhat unexpected in that Xcdsl 
does phosphorylate Xenopus Cdc25 well on Ser-287 in its 14-3-3 binding site. Cdc25 
has been implicated as a target of the DNA replication/damage checkpoints in fission 
yeast, humans, and Xenopus egg extracts (Furnari et aL, (1997) Science 277, 1495- 
1497; Sanchez et al y (1997) supra; Peng et aL y (1997) supra, (Kumagai^/ a/., (1998) 
supra), although an additional target(s) such as Weel could also be involved 
(O'Connell et a/., (1997) EMBO J. 16, 545-554). On the other hand, Lindsay et al 
((1998) supra) have proposed that fission yeast Cdsl may play a distinct role in 
regulating the formation or stabilization of replication structures. If the function of 
Xcdsl does not involve delaying the cell cycle, the implication would be that there is 
another factor(s) in egg extracts that carries out this role in response to double- 
stranded DNA ends. 

[01 85) Another type of explanation for the intact checkpoint delay in Xcdsl - 

depleted extracts is that another factor in these extracts acts redundantly with Xcdsl . 
This putative factor would not appear to be Xchkl, because immunodepletion of 
Xchkl alone or in combination with Xcdsl did not compromise the cell cycle delay in 
response to poly(dT) 4 o- However, approximately 70% of the total kinase activity 
towards Ser-287 of Cdc25 remains behind in egg extracts lacking both Xcdsl and 
Xchkl . It will be necessary to identify this Ser-287-specific kinase in order to 
evaluate the possibility that it contributes to cell cycle regulation in response to 
double-stranded DNA ends. The kinase(s) that lies upstream of Xcdsl in the 
pathways triggered by double-stranded DNA ends is currently unknown. In the 
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human system, convincing evidence has been presented that ATM is an upstream 
regulator of human Chk2/Cdsl (Mutsuoka et al 9 (1 998) supra; Brown et al. 7 (1999) 
supra). AT ceils are hypersensitive to ionizing radiation (Meyn, (1995) Cancer Res. 
55, 5991-6061), which suggests that ATM is involved in mediating the response to 
double-stranded DNA ends. The recently described Xenopus homologue of ATM 
would be a good candidate for a regulator of Xcdsl (Robertson et ai, (1999) 
Oncogene 18, 7070-7079). Since the agents to which Xchkl responds are quite 
different from those which activate Xcdsl, the implication is that ATM may not be 
the principal upstream regulator of Xchkl in the replication checkpoint response. 

[0186] It is to be understood that while the invention has been described in 

conjunction with the detailed description thereof, the foregoing description is intended 
to illustrate and not limit the scope of the invention, which is defined by the scope of 
the appended claims. Other aspects, advantages, and modifications are within the 
scope of the following claims. 



